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INTRODUCTION: 


Type  1  Diabetes  (T1D)  is  associated  with  increased  risk  of  T1  D-Nephropathy  (T1DN)  and  is  usually 
accompanied  by  other  diabetic-related  complications  such  as  retinopathy,  neuropathy,  blood  pressure 
elevation,  and  high  risk  of  cardiovascular  morbidity  and  mortality.  Sixteen  million  people  in  the  US  have 
diabetes  with  800,000  new  cases  diagnosed  each  year.  Diabetic  complications  affect  most  diabetic  patients. 
Diabetes  occurs  in  men,  women,  children  and  the  elderly.  African,  Hispanic,  Native  and  Asian  Americans  are 
particularly  susceptible  to  its  most  severe  complications.  An  estimated  20%  to  40%  of  T1 D  patients  will  develop 
diabetic  nephropathy,  clinically  first  evidenced  by  microalbuminuria,  during  their  lifetime.  If  untreated  nearly  all 
T1D  patients  experiencing  microalbuminuria  will  progress  to  overt  nephropathy,  evidenced  by 
macroalbuminuria,  and  culminating  in  TID-End  Stage  Renal  Disease  (T1D-ESRD).  Improved  prediction  of  risk 
for  developing  diabetes  and  diabetic  complications  among  active  duty  members  of  the  military,  their  families 
and  retired  military  personnel  will  potentially  allow  focused  preventative  treatment  of  at-risk  individuals, 
providing  significant  healthcare  savings  and  improved  patient  well  being. 

BODY: 

Our  first  quarterly  scientific  progress  report  (08/27/07  -  1 1/30/07)  detailed  the  following  steps  forward 
in  reaching  the  aims  of  our  study. 

The  proposed  study  will  investigate  how  environmental  exposure(s)  and  genetic  variants 
interact  to  amplify  risk  for  Type  1  Diabetic  (T1D)  nephropathy  (T1DN)  and  substantially  increase 
incidence  of  end-stage  renal  disease  (T1D-ESRD).  The  work  that  will  be  performed  during  the 
proposed  2  year  study  will  exploit  the  results  of  our  previously  funded  application  to  the  DOD  entitled 
"New  Advanced  Technology  to  Improve  Prediction  of  Type  1  Diabetes  and  Its  Complications”,  the 
aims  of  which  are  nearing  completion.  The  previous  project  produced  3  principal  results:  1) 
Transmission/Disequilibrium  Testing  (TDT)  analysis  using  a  cohort  of  92  case  family  trios  for  the 
association  of  121  candidate  genes  with  T1DN,  identifying  significant  association  between  T1DN  and 
4  of  the  loci  tested;  2)  Recruitment  of  an  independent  cohort  of  2,881  participants,  consisting  of  1,174 
case/control  singletons  and  569  case/control  family  trios;  and  3)  a  genome-wide  association  scan 
(GWAS)  of  197  singleton  cases  and  197  controls  all  of  whom  were  non-smokers. 

The  currently  proposed  work  follows  up  on  the  significant  genetic  signals  identified  during  our 
previous  project. 

To  accomplish  our  aims  we  originally  proposed  the  following  tasks: 

Task  1.  Using  our  available  cohort  of  257  T1DN  (case)  family  trios  perform  an  additional 
series  of  GWAS  using  the  500,000  SNP  typing  microarray  in  order  to  confirm  the  strongest  genetic 
signals  associated  with  diabetic  kidney  disease. 

Task  2.  Recruit  an  additional  200  participants  with  T1  DN  who  have  never  smoked  cigarettes 
in  order  to  increase  the  total  number  of  singleton  case  participants  to  greater  than  280. 

Task  3.  Design  customized  microarrays  in  order  to  examine  the  significant  genetic  markers 
using  an  independent  cohort  of  singleton  cases  and  controls  matched  for  exposure  to  cigarette 
smoking.  The  available  cohort  for  this  stage  of  the  project  is:  Non-Smokers,  86  case  and  201  control; 
Smokers,  295  case  and  198  control. 

Task  4.  Perform  SNP  genotyping  of  the  most  significant  genetic  markers  using  an  available 
control  cohort  consisting  of  312  family  trios  matched  for  exposure  to  cigarette  smoking:  Non- 
Smokers,  216;  Smokers,  96. 

Task  5.  In  order  to  complete  the  analysis  of  the  genetic  and  environmental  exposure  data  for 
association  with  T1DN  we  will  compare  the  genetic  marker  frequency  (i.e. ,  SNPs,  Copy  Number 
Variation,  Haplotype,  Gene-Gene,  and  Gene-Environment  interactions)  with  occurrence  of  the 
disease. 

This  quarterly  report  provides  a  perfect  opportunity  to  present  an  update  of  our  progress  regarding 
the  T1D  genome-wide  scan  (GWAS).  The  technical  and  statistical  solutions  successfully  used  to 
confirm  the  genetic  regions  most  associated  T1 D  will  then  be  used  to  investigate  T1  DN  associations. 
A  cohort  of  422  T1  D  and  2,144  non-TID  participants  were  used  to  investigate  whether  it  would  be 
possible  to  identify  novel  SNPs  for  association  with  the  phenotype  (Table  1).  The  T1D  participants 
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were  recruited  from  the  Children's  Hospital  of  Pittsburgh  (N=28)  and  obtained  from  the  GoKinD  study 
(N=394).  Non-TID  participants  were  provided  by  the  KORA  and  POPGEN  studies  and  were 
received  as  computer  files  of  genotype  information.  All  participants,  whether  collected  in  Pittsburgh 
or  provided  by  KORA/POPGEN,  were  genotyped  using  the  Affymetrix  500K  GeneChip.  This  is  the 
same  SNP  typing  array  that  was  used  by  the  Wellcome  Trust  Case  Control  Consortium  (WTCCC). 
We  are  in  the  process  of  applying  for  access  to  the  WTCCC  dataset  and  anticipate  having  a  positive 
reply  before  February  2008. 


Table  1.  Characteristics  of  Case  and  Control  Participants. 


Case  Participants 

Control  Participants 

Demographic  Characteristics: 

CHP 

GoKinD 

KORA 

POPGEN 

Number  of  Singletons 

28 

394 

1644 

500 

European  American  (%) 

100% 

100% 

.... 

.... 

German  Resident  (%) 

.... 

.... 

100% 

100% 

Male  Gender  (%) 

50% 

46.7% 

49.5% 

51.8% 

History  of  Diabetes: 

Type  1  Diabetes  (%) 

100% 

100% 

.... 

.... 

Mean  Age  at  T1D  Diagnosis  (yr) 

12.7(7.9) 

12.2(7.1) 

.... 

.... 

Values  in  parentheses  indicate  standard  deviation  from  the  mean. 

As  expected,  we  observed  a  large  signal  corresponding  to  the  HLA  region  that  exceeded  genome¬ 
wide  significance  (Figure  1).  Other  notable  signals  were  observed  but  fell  short  of  the  Bonferroni 
correction  for  the  threshold  of  significance  (p-value  less  than  4x1  O'7).  Suggestive  signals  observed 
for  several  previously  identified  T1D  loci  are  summarized  in  Table  2.  For  example,  the 
PHTF1/PTPN22  loci  (p=1  xl  O'5),  CTLA4  (p=3x10'4),  and  IL2RA  (p=1x10'3)  are  commonly  considered 
as  confirmed  susceptibility  loci  associated  with  T1D.  Other  suggested  T1D  susceptibility  loci  (e.g., 
ITPR3,  NOD2,  and  PDX1)  were  also  identified  by  the  GWAS.  In  contrast,  the  confirmed  loci  CD25, 
IFIH1,  and  INS  were  not  readily  observed  as  nearby  SNPs  did  not  exceed  a  p-value  of  0.001.  This 
was  possibly  the  result  of  the  Affymetrix  GeneChip  not  including  a  sufficient  number  of  SNPs  in 
linkage  disequilibrium  with  the  implicated  alleles. 


fMatch 


Chromosome  Location 

Figure  1.  Summary  of  the  GWAS  data  obtained  for  T1D.  The  x-axis  is  chromosome  and  location  while  the  y-axis  is  the  - 
Log  of  the  p-value.  As  expected  the  HLA  region  on  chromosome  6p  attained  genome-wide  significance.  All  other  p- 
values  exceeding  1x10  7  have  been  removed  from  consideration  due  to  poor  quality  genotyping  data.  The  complete  list  of 
GWAS  data  is  included  in  the  Appendix. 
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Table  2.  Confirmed  and  suggested  T1D  loci  identified  during  the  T1D  GWAS. 


Chr 

Locus 

Location 

dbSNP  ID 

Dominant 

Additive 

Recessive 

1 

PHTF1 

114,045,700 

rs  1230649 

1.2E-02 

7.2E-04 

1.0E-01 

1 

PHTF1 

114,105,331 

rs6679677 

3.3E-05 

1.0E-05 

4.4E-02 

1 

PTPN22 

114,216,891 

rs2488457 

3.8E-03 

7.1E-04 

1.2E-01 

2 

CTLA4 

204,428,384 

rsl  1571292 

2.5E-02 

4.0E-03 

2.6E-04 

2 

CTLA4 

204,449,111 

rs231726 

7.3E-03 

3.4E-04 

5.2E-04 

2 

CTLA4 

204,449,411 

rs  1427676 

6.5E-03 

4.5E-04 

6.0E-04 

6 

ITPR3 

33,650,501 

rs9296095 

2.2E-04 

3.1E-03 

1.0E-02 

6 

ITPR3 

33,682,987 

rs444697 

2.5E-04 

3.9E-05 

5.1E-04 

10 

IL2RA 

6,132,099 

rsl0905669 

6.2E-02 

1.0E-03 

3.9E-02 

13 

PDX1 

27,374,640 

rs9554196 

6.5E-04 

1.8E-02 

9.5E-03 

13 

PDX1 

27,377,980 

rs93 19399 

3.8E-04 

1.6E-02 

7.5E-03 

13 

PDX1 

27,378,356 

rs9551419 

7.4E-04 

2.2E-02 

9.5E-03 

16 

NOD2 

49,281,588 

rs9933594 

9.1E-04 

1.3E-02 

6.3E-03 

Table  3  lists  select  data  obtained  from  the  GWAS.  Of  the  loci  identified,  there  were  12  that  are  currently  listed  by 
T1  DBase  (http://t1dbase.org)  as  candidates  for  T1D.  Moreover,  there  are  a  number  of  novel  loci  that  may  be 
interesting  to  evaluate  during  a  staged  investigation  of  the  phenotype.  For  example,  the  GWAS  identified  loci 
involved  in  insulin  signaling  ( CRK  and  IRS2),  cytokine  signaling  ( IL6ST ,  IRF4,  TRAF3lp1)  and  potassium  ion 
transport  (KCNH7  and  KCNQ1).  Moreover,  the  GWAS  identified  a  subunit  of  mitochondrial  complex  1  ( NDUFA10 ) 
and  the  apoptosis  regulatory  factor  FKSG2  that  may  influence  superoxide  production  and  subsequent  susceptibility 
to  programmed  cell  death. 

Table  3.  Select  loci  observed  during  the  T1 D  GWAS. 


Chr 

Locus 

Location 

dbSNP  ID 

Dominant 

Additive 

Recessive 

T1  DBase  Candidate  Loci: 

1 

IL12RB2 

67,612,161 

rs3828069 

5.7E-03 

9.3E-03 

9.8E-04 

1 

PHTF1/PTPN22 

114,105,331 

rs6679677 

3.3E-05 

1.0E-05 

4.4E-02 

2 

CTLA4 

204,428,384 

rsl  1571 292 

2.5E-02 

4.0E-03 

2.6E-04 

3 

SENP7 

102,504,337 

rsl  6843699 

8.5E-04 

7.5E-04 

6.8E-02 

6 

ITPR3 

33,682,987 

rs444697 

2.5E-04 

3.9E-05 

5.1  E-04 

10 

IL2RA 

6,132,099 

rsl  0905669 

6.2E-02 

1.0E-03 

3.9E-02 

11 

LRP5 

67,813,346 

rsl  1606508 

2.2E-03 

3.1  E-03 

9.4E-04 

12 

CD4 

6,775,811 

rsl  075836 

6.0E-02 

2.7E-04 

5.9E-02 

12 

TRAFD1 

111,002,025 

rsl  980364 

9.6E-04 

9.4E-03 

5.3E-04 

12 

PTPN11 

111,462,160 

rsl  0850053 

6.7E-04 

9.2E-03 

5.5E-04 

13 

PDX1 

27,377,980 

rs931 9399 

3.8E-04 

1.6E-02 

7.5E-03 

16 

NOD2 

49,281,588 

rs9933594 

9.1  E-04 

1.3E-02 

6.3E-03 

Novel  T1D  Candidate  Loci: 

2 

KCNH7 

163,442,376 

rsl  0203602 

1.1E-03 

2.4E-04 

3.7E-03 

2 

TRAF3IP1 

238,928,000 

rs821 804 

1.7E-05 

7.1  E-05 

4.4E-05 

2 

NDUFA10 

240,555,969 

rs4854067 

1.6E-03 

2.5E-02 

1.2  E-04 

5 

IL6ST 

55,388,527 

rsl  0042443 

2.2E-04 

1.4  E-05 

1.4  E-04 

6 

IRF4 

374,457 

rs950286 

5.2E-05 

6.7E-06 

4.8E-03 

8 

FKSG2 

36,899,367 

rsl  01 0669 

6.3E-02 

1.6E-02 

3.9E-05 

8 

EYA1 

72,362,997 

rs47381 23 

5.0E-03 

6.2E-05 

8.7E-04 

9 

IL6RL1 

91,744,788 

rs7864499 

2.0E-04 

5.1  E-04 

2.6E-03 

11 

KCNQ1 

2,781 ,804 

rsl  631 66 

4.4E-05 

1.7E-02 

3.3E-03 

13 

IRS2 

109,307,346 

rsl  041 637 

1.6E-03 

6.0E-06 

1.8  E-04 

15 

SEMA6D 

45,680,428 

rsl  6959754 

8.7E-06 

9.1  E-05 

6.3E-06 

17 

CRK 

1,278,556 

rs8070640 

2.7E-05 

9.4E-04 

5.7E-05 
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The  complete  list  of  SNPs  implicated  during  the  GWAS  has  been  included  in  the  Appendix.  That  file  contains 
the  chromosome  location  of  each  SNP  as  well  as  the  p-value  associated  with  the  3  major  modes  of  inheritance 
(i.e.,  additive,  dominant,  and  recessive).  The  Appendix  also  summarizes  the  quality  control  analysis  for  those 
SNPs  with  p-values  less  than  0.0001  and  indicates  which  SNPs  overlap  T1D  candidate  loci  that  have  been 
listed  by  T1  DBase.  This  is  the  principal  file  that  is  being  used  to  determine  which  signals  obtained  during  the 
GWAS  for  T1  D  should  be  studied  during  Stage  2  analysis  of  the  T1  D  results. 

There  are  expected  to  be  14,047  participants  available  for  the  T1D  project  (Table  4).  Our  repository  is 
anticipated  to  contain  a  modest  number  of  T1D  family  trios  (N=719),  T1D  singletons  (N=4,246),  and  non-TID 
control  singletons  (N=7,644).  Table  4  summarizes  the  available  cohorts  and  details  the  source  of  each 
participant,  and  whether  material  is  provided  as  DNA  or  as  a  computer  file  of  previously  determined  genotypes. 
When  genotypes  are  provided  they  were  obtained  using  the  same  Affymetrix  500K  GeneChip  that  was  used 
during  our  GWAS  for  T 1  D. 

Table  4.  Summary  of  cohort  available  and  anticipated  for  T1 D  analysis. 


Family  Trios 

Case  and  Control  Singletons 

Source 

T1 D  Familv  Trios 

Case 

Control 

Comment 

EMIL 

150 

300 

2500 

DNA  available 

GoKinD 

569 

1746 

DNA  available 

KORA 

1644 

GeneChip  data  available 

POPGEN 

500 

GeneChip  data  available 

Pittsburgh 

200 

DNA  available 

WTCCC 

2000 

3000 

GeneChip  data  available 

At  this  stage  of  the  project  we  need  to  prioritize  the  SNPs  identified  during  the  GWAS  and  then  evaluate 
suggestive  SNPs  using  the  T1D  family  trios.  The  University  of  Pittsburgh  Genomics  Core  Laboratory  has 
recently  acquired  the  Sequenom  SNP  typing  platform.  The  Sequenom  system  provides  sufficient  throughput 
and  genotyping  accuracy  for  projects  such  as  ours.  The  fee  for  using  this  system  has  not  yet  been  finalized 
but  is  anticipated  to  be  about  10  cents  per  SNP.  This  is  about  one-fifth  the  fee  associated  with  genotyping 
using  TaqMan  methodology.  It  is  anticipated  that  when  working  with  T1D  family  trios  we  will  genotype  2,157 
participants  (N=3x719)  for  about  45  SNPs  at  a  cost  of  roughly  $9,700.  Choosing  the  45  SNPs  to  genotype  is 
the  aim  that  needs  to  be  completed  in  the  near  future. 

The  next  phase  of  analysis  will  have  2  goals.  The  first  being  to  measure  the  p-values  for  confirmed  and 
suggested  T1D  loci  and  the  second  being  to  test  whether  the  signals  associated  with  novel  T1D  loci  are 
reproducible. 


12.  Statement  of  Plans  and  Milestones  for  the  Next  Quarter 

There  are  2  goals  (Goal  1  and  2)  that  we  anticipate  completing  and  1  goal  (Goal  3)  that  will  be  initiated  during 
the  next  research  period. 

Goal  1.  Increase  the  number  of  participants.  Milestone  1  A.  Apply  to  the  WTCCC  for  access  to  the  T1 D  and 
control  cohorts.  Milestone  1 B.  Continue  to  recruit  T 1 D  case  trios  and  singletons  from  Pittsburgh. 

Our  application  to  the  WTCCC  for  access  to  their  cohort  of  T1D  (N=2,000)  and  non-TID  control  (N=3,000) 
singletons  has  been  finalized.  The  project  title  is  "Detection  of  Susceptibility  Loci  in  Multifactorial  Disease". 
The  principal  aim  is  to  combine  our  original  GWAS  results  with  the  GWAS  performed  by  WTCCC  followed  by 
statistical  genetic  analysis.  It  is  anticipated  that  the  outcome  will  be  an  analysis  with  increased  power  to  detect 
true  signals  for  association  with  T1D  risk.  We  will  also  continue  to  recruit  T1D  participants  through  our 
collaboration  with  the  University  of  Pittsburgh  Transplantation  Institute.  Samples  collected  from  Pittsburgh  will 
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be  used  to  increase  the  size  of  our  DNA  repository  enabling  increased  power  of  the  genetic  study  to  detect  true 
association  signals. 


Goal  2.  Determine  high  priority  SNPs  to  study.  Milestone  2A.  Identify  confirmed  and/or  suggestive  T1D  risk 
loci  using  our  original  GWAS.  Milestone  2B.  Identify  novel  loci  to  evaluate  for  T1 D  risk. 

Data  obtained  during  our  original  GWAS  will  be  compared  with  the  list  of  confirmed  and  suggested  T1D  risk 
loci  in  order  to  identify  where  the  different  datasets  overlap.  Moreover,  novel  SNPs  identified  during  the 
original  GWAS  will  be  complied  based  upon  the  following  criteria:  i)  SNPs  with  p-values  less  than  0.0001;  ii) 
SNPs  with  p-values  less  than  0.001  occurring  within  50,000bp  of  a  known  locus;  and  iii)  Clusters  of  3  or  more 
SNPs  with  p-vlaues  less  than  0.001  occurring  within  50,000bp  of  one  another.  The  priority  SNPs  will  be  used 
to  compile  a  list  that  will  direct  subsequent  laboratory  analyses. 


Goal  3.  Initiate  genotyping  of  priority  SNPs.  Milestone  3A.  Use  the  cohort  of  T1D  family  trios  to  evaluate 
confirmed  SNPs  for  T1D  risk.  Milestone  3B.  Use  the  cohort  of  T1D  family  trios  to  evaluate  novel  SNPs  for 
T1  D  risk. 

Our  DNA  repository  includes  a  collection  of  T1D  family  trios  (N=719)  that  can  be  used  to  evaluate  the  SNPs 
identified  during  Goal  2.  SNPs  will  be  chosen  from  the  list  of  confirmed  and  suggestive  T1D  risk  as  well  as 
from  the  list  of  novel  genetic  variants.  The  genotype  of  each  SNP  will  be  determined  among  the  various 
participants  and  the  results  will  be  evaluated  by  transmission  disequilibrium  testing  to  determine  association 
with  T1  D. 

In  our  second  quarterly  scientific  progress  report  (12/01/07  -  02/29/08,  we  presented  the  following  data: 

Type  1  Diabetes  Genome-Wide  Association  Study  Update: 

The  previous  quarterly  report  summarized  our  progress  regarding  the  genome-wide  association  (GWA)  scan 
for  Type  1  Diabetes  (T1  D)  susceptibility  alleles.  We  have  reported  the  genotype  results  from  a  cohort  of  422 
T1D  (case)  and  2,144  non-TID  (control)  participants  using  322,347  SNPs.  The  anthropometric  properties  of 
the  case  and  control  cohort  has  been  described  in  the  previously  quarterly  report  but  is  reproduced  below  for 
convenience. 


Table  1.  Characteristics  of  Case  and  Control  Participants. 


Case  Participants 

Control  Participants 

Demographic  Characteristics: 

CHP 

GoKinD 

KORA 

POPGEN 

Number  of  Singletons 

28 

394 

1644 

500 

European  American  (%) 

1 00% 

1 00% 

— 

— 

German  Resident  (%) 

— 

— 

1 00% 

1 00% 

Male  Gender  (%) 

50% 

46.7% 

49.5% 

51.8% 

History  of  Diabetes: 

Type  1  Diabetes  (%) 

1 00% 

1 00% 

— 

.... 

Mean  Age  at  T1 D  Diagnosis  (yr) 

12.7(7.9) 

12.2(7.1) 

.... 

.... 

Values  in  parentheses  indicate  standard  deviation  from  the  mean. 

As  outlined  in  the  following  sections,  the  research  goals  for  the  first  quarter  of  2008  were  3-fold:  1)  Increase  the 
number  of  participants;  2)  Determine  high  priority  SNPs  to  study;  and  3)  Initiate  genotyping  of  high  priority 
SNPs.  Our  progress  for  each  goal  is  addressed  below. 
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Goal  1.  Increase  the  number  of  participants.  Milestone  1A.  Apply  to  the  Wellcome  Trust  Case  Control 
Consortium  (WTCCC)  for  access  to  the  T1D  and  control  cohorts.  Milestone  IB.  Continue  to  recruit  T1D  case 
trios  and  singletons  from  Pittsburgh. 

In  Milestone  1A  we  proposed  to  increase  the  number  of  participants  in  our  GWA  study  by  applying  to  the 
WTCCC  for  access  to  their  GWA  dataset.  The  WTCC  dataset  contains  SNP  genotypes  collected  from  2,000 
T1D  and  3,000  non-TID  participants.  The  WTCCC  and  our  group  used  identical  Affymetrix  manufactured 
genotyping  arrays  to  evaluate  risk  for  T1D.  Therefore,  SNPs  evaluated  by  the  WTCCC  and  Pittsburgh  GWA 
studies  are  essentially  the  same.  As  a  result,  the  dataset  collected  at  Pittsburgh  and  by  the  WTCCC  can  be 
combined  in  order  to  increase  the  overall  size  of  the  study  thus  replicating  the  study  while  simultaneously 
improving  the  power  of  the  study  to  identify  T1  D  susceptibility  alleles.  Once  the  application  for  data  sharing  is 
approved  by  the  WTCCC  committee  that  evaluates  requests,  our  analysis  of  the  WTCCC  dataset  can  begin 
immediately.  The  application  is  complete,  pending  final  signatures  from  the  University  of  Pittsburgh  Office  of 
Grants  and  Research,  and  we  anticipate  submission  to  the  WTCCC  will  occur  during  the  upcoming  research 
quarter.  A  copy  of  the  application  is  attached  as  an  Appendix  to  this  quarterly  report. 

In  Milestone  IB,  we  proposed  to  continue  recruitment  of  T1D  case  trios  and  singletons  from  the  University  of 
Pittsburgh  Medical  Center  (UPMC).  Recruitment  of  new  participants  is  one  of  the  primary  focuses  of  the 
project  and  patients  will  be  recruited  continuously  during  the  project's  lifetime.  The  new  participants  recruited 
during  Goal  1  will  be  used  to  confirm  the  genetic  signals  identified  from  the  cohort  described  in  Table  1  that 
was  used  during  our  previously  completed  GWA.  We  can  follow  the  health  of  these  patients  at  multiple  times 
during  the  project  since  our  IRB  approved  protocol  allows  us  to  contact  formerly  recruited  human  subjects  for 
the  purpose  of  updating  their  health  information  and,  when  appropriate,  collecting  a  second  vial  of  blood. 
Table  2  summarizes  the  current  collection  of  T1 D  participants  recruited  in  Pittsburgh. 


Table  2.  Characteristics  of  Singleton  and  Family  Trios  Participants  Recruited  at  Pittsburgh. 


Pittsburgh  Cohorts 

GoKinD  Cohort 

Demographic  Characteristics: 

Sinqletons 

Family  Trios 

Family  Trios 

Number  of  Participants 

46 

26 

569 

European  American  (%) 

100 

100 

100 

History  of  Diabetes: 

Type  1  Diabetes  (%) 

100 

100 

100 

Mean  Age  at  T1 D  Diagnosis  (yr) 

14(7) 

14(7) 

11(6) 

Values  in  parentheses  indicate  standard  deviation  from  the  mean.  Pittsburgh  samples  were  recruited  at  the 
University  of  Pittsburgh  Medical  Center.  GoKinD  participants  were  recruited  by  the  Genetics  of  Kidneys  in  Diabetes 
Study. 


Goal  2.  Determine  high  priority  SNPs  to  study.  Milestone  2 A.  Identify  confirmed  and/or  suggestive  T1D  risk 
loci  using  our  original  GWA  study.  Milestone  2B.  Identify  novel  loci  to  evaluate  for  T1D  risk. 

We  have  completed  Milestones  2A  by  evaluating  our  T1D  GWA  data  against  a  list  of  known  and  suspected 
T1  D  risk  loci.  The  identities  of  the  T1  D  risk  loci  were  obtained  from  T1  Dbase  (http://t1dbase.org)  a  world-wide- 
web  accessible  database  of  T1D  genetic  risk  information.  The  genomic  location  of  these  confirmed  T1D 
susceptibility  loci  were  compared  with  the  location  of  SNPs  that  during  the  previously  completed  GWA 
associated  with  p-values  exceeding  0.001.  The  analysis  identified  12  strongly  associated  SNPs  occurring 
within  50Kb  of  T1 D  candidate  loci.  SNPs  occurring  near  confirmed  T1  D  loci  are  summarized  in  the  Table  3. 

Table  3.  Summary  of  GWA  Results  for  SNPs  located  within  50Kb  of  confirmed  T1 D  Susceptibility  Loci. 


Chr 

dbSNP  ID 

Location 

Locus 

Official  Full  Name 

1 

RS6679677 

114,105,331 

PHTF1 

Putative  homeodomain  transcription  factor  1 

2 

Rs231 726 

204,449,111 

CTLA4 

Cytotoxic  T-lymphocyte-associated  protein  4 

2 

Rsl 427676 

204,449,411 

CTLA4 

Cytotoxic  T-lymphocyte-associated  protein  4 

7 

Rsl 0232298 

127,006,653 

PAX4 

Paired  box  4 

9 


10 

Rsl 0905668 

6,132,061 

IL2RA 

10 

Rsl 0905669 

6,132,099 

IL2RA 

10 

RS7074372 

6,228,597 

IL2RA 

12 

Rsl 11 71 739 

54,756,892 

ERBB3 

13 

RS3812863 

27,443,268 

PDX1 

16 

RS9746695 

11,115,395 

CLEC16A 

16 

Rsl 1640295 

11,298,229 

CLEC16A 

20 

RS2425756 

44,223,669 

CD40 

Interleukin  2  receptor,  alpha 
Interleukin  2  receptor,  alpha 
Interleukin  2  receptor,  alpha 

v-erb-b2  erythroblastic  leukemia  viral  oncogene  homolog  3 
(avain) 

Pancreatic  and  duodenal  homeobox  1 
C-type  lectin  domain  family  16,  member  A 
C-type  lectin  domain  family  1 6,  member  A 


CD40  molecule,  TNF  receptor  superfamily  member  5 
Signals  from  the  HLA  region  on  chromosome  6  are  excluded  from  the  information  presented  in  the  table. 


Milestone  2B  has  also  been  completed  during  the  last  quarterly  research  period.  The  GWA  scan  was 
evaluated  to  identify  SNPs  that  implicated  novel  genetic  loci  for  further  investigation.  As  summarized  in  Table 
4  there  were  24  loci  identified  by  SNPs  that  had  p-values  exceeding  the  10th  percentile  of  all  association 
signals  garnered  during  the  previously  completed  GWA.  For  several  loci,  such  as,  KCNH7,  STXBP5,  and 
RXFP2  there  were  clusters  of  5  or  more  SNPs  covering  genomic  regions  from  64Kb  and  up  to  247Kb.  These 
loci,  as  well  as  others  with  strong  evidence  for  association  are  included  in  Table  4. 

Table  4.  Summary  of  GWA  Results  for  SNPs  located  within  50Kb  of  Novel  Loci. 


Chr 

dbSNP  ID 

Location 

Locus 

Official  Full  Name 

1 

rsl  0858046 

114,875,303 

AMPD1 

Adenosine  monophosphate  deaminase  1  (isoform  M) 

1 

rs6689415 

114,908,992 

AMPD1 

Adenosine  monophosphate  deaminase  1  (isoform  M) 

1 

rs2268697 

115,029,016 

AMPD1 

Adenosine  monophosphate  deaminase  1  (isoform  M) 

1 

rs6427093 

165,823,621 

CREG1 

Cellular  repressor  of  El  A-stimulated  genes  1 

1 

rs2094028 

194,417,119 

KCNT2 

Potassium  channel,  subfamily  T,  member  2 

1 

rsl  6839682 

194,497,622 

KCNT2 

Potassium  channel,  subfamily  T,  member  2 

2 

rsl  2470503 

163,405,297 

KCNH7 

Potassium  voltage-gated  channel,  subfamily  H  (eag- 
related),  member  7 

2 

rsl  001 911 

163,421,846 

KCNH7 

Potassium  voltage-gated  channel,  subfamily  H  (eag- 
related),  member  7 

2 

rsl  492990 

163,422,123 

KCNH7 

Potassium  voltage-gated  channel,  subfamily  H  (eag- 
related),  member  7 

2 

rsl  0203602 

163,442,376 

KCNH7 

Potassium  voltage-gated  channel,  subfamily  H  (eag- 
related),  member  7 

2 

rsl  364562 

163,449,972 

KCNH7 

Potassium  voltage-gated  channel,  subfamily  H  (eag- 
related),  member  7 

2 

rs307869 

163,469,837 

KCNH7 

Potassium  voltage-gated  channel,  subfamily  H  (eag- 
related),  member  7 

2 

rs2028201 

201,011,255 

KCTD18 

Potassium  channel  tetramerisation  domain  containing  18 

2 

rs821804 

238,928,000 

TRAF3IP1 

TNF  receptor-associated  factor  3  interacting  protein  1 

2 

rs523437 

239,063,548 

TRAF3IP1 

TNF  receptor-associated  factor  3  interacting  protein  1 

2 

rs580209 

239,065,169 

TRAF3IP1 

TNF  receptor-associated  factor  3  interacting  protein  1 

5 

rsl  0042443 

55,388,527 

IL6ST 

Interleukin  6  signal  transducer  (gp130,  oncostatin  M 
receptor) 

6 

rs950286 

374,457 

IRF4 

Interferon  regulatory  factor  4 

6 

rs4959880 

387,206 

IRF4 

Interferon  regulatory  factor  4 

6 

rsl  321 4605 

394,483 

IRF4 

Interferon  regulatory  factor  4 

6 

rs2498587 

118,133,432 

GOPC 

Golgi  associated  PDZ  and  coiled-coil  motif  containing 

6 

rs9322085 

147,354,835 

STXBP5 

Syntaxin  binding  protein  5  (tomosyn) 

6 

rs7765767 

147,366,631 

STXBP5 

Syntaxin  binding  protein  5  (tomosyn) 

6 

rsl  52901 7 

147,392,995 

STXBP5 

Syntaxin  binding  protein  5  (tomosyn) 

6 

rs6919814 

147,412,758 

STXBP5 

Syntaxin  binding  protein  5  (tomosyn) 

6 

rs6920096 

147,412,791 

STXBP5 

Syntaxin  binding  protein  5  (tomosyn) 

6 

rsl  881 660 

147,420,052 

STXBP5 

Syntaxin  binding  protein  5  (tomosyn) 

6 

rs6930740 

147,438,438 

STXBP5 

Syntaxin  binding  protein  5  (tomosyn) 

6 

rs6920901 

147,470,991 

STXBP5 

Syntaxin  binding  protein  5  (tomosyn) 
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7 

rsl  294610 

7,332,182 

C1GALT1 

7 

rsl  294651 

7,381,742 

C1GALT1 

7 

rsl  404829 

144,255,618 

TPK1 

9 

rs2808784 

115,961,345 

COL27A1 

9 

rs2567724 

115,961,754 

COL27A1 

9 

rsl  0982093 

115,976,726 

COL27A1 

10 

rs2803590 

88,362,973 

BMPR1A 

10 

rs2803568 

88,426,595 

BMPR1A 

11 

rs574434 

100,384,527 

PGR 

12 

rs7305099 

845,537 

WNK1 

12 

rsl  282801 6 

868,626 

WNK1 

13 

rs7328020 

30,310,128 

ALOX5AP 

13 

rs4941 903 

31,140,471 

RXFP2 

13 

rs7336382 

31,146,759 

RXFP2 

13 

rs3950017 

31,151,027 

RXFP2 

13 

rs641 696 

31,376,444 

RXFP2 

13 

rs20341 5 

31,387,796 

RXFP2 

13 

rsl  041 637 

109,307,346 

IRS2 

14 

rsl  873274 

62,287,052 

KCNH5 

15 

rs4779798 

27,197,881 

APBA2 

15 

rs2676083 

31,397,903 

RYR3 

15 

rs25961 81 

31,438,457 

RYR3 

18 

rs9959583 

20,993,778 

ZNF521 

18 

rs8085678 

20,998,828 

ZNF521 

18 

rsl  971613 

21,589,323 

ZNF521 

19 

rs687136 

45,366,831 

AKT2 

19 

rs687155 

45,366,846 

AKT2 

Core  1  synthase,  glycoprotein-N-acetylgalactosamine  3 
beta-galactosyltransferase,  1 

Core  1  synthase,  glycoprotein-N-acetylgalactosamine  3 

beta-galactosyltransferase,  1 

Thiamin  pyrophosphokinase  1 

Collagen,  type  XXVII,  alpha  1 

Collagen,  type  XXVII,  alpha  1 

Collagen,  type  XXVII,  alpha  1 

Bone  morphogenetic  protein  receptor,  type  1A 

Bone  morphogenetic  protein  receptor,  type  1A 

Progesterone  receptor 

WNK  lysine  deficient  protein  kinase  1 

WNK  lysine  deficient  protein  kinase  1 

Arachidonate  5-lipoxygenase-activating  protein 

Relaxin/insulin-like  family  peptide  receptor  2 

Relaxin/insulin-like  family  peptide  receptor  2 

Relaxin/insulin-like  family  peptide  receptor  2 

Relaxin/insulin-like  family  peptide  receptor  2 

Relaxin/insulin-like  family  peptide  receptor  2 

Insulin  receptor  substrate  2 

Potassium  voltage-gated  channel,  subfamily  H  (eag- 
related),  member  5 

Amyloid  beta  (A4)  precursor  protein-binding,  family  A, 
member  2  (XII -like) 

Ryanodine  receptor  3 
Ryanodine  receptor  3 
Zinc  finger  protein  521 
Zinc  finger  protein  521 
Zinc  finger  protein  521 


v-akt  murine  thymoma  viral  oncogene  homolog  2 
v-akt  murine  thymoma  viral  oncogene  homolog  2 
Signals  from  the  HLA  region  on  chromosome  6  are  excluded  from  the  information  presented  in  the  table. 


Goal  3.  Initiate  genotyping  of  priority  SNPs.  Milestone  3 A.  Use  the  cohort  of  T1D  family  trios  to  evaluate 
confirmed  SNPs  for  T1D  risk.  Milestone  3B.  Use  the  cohort  of  T1D  family  trios  to  evaluate  novel  SNPs  for 
TIDrisk. 

As  outlined  in  Goal  3  Milestones  3A  and  3B  we  have  begun  the  process  of  evaluating  SNPs  selected  from 
those  shown  in  Tables  3  and  4  in  order  to  evaluate  interesting  SNPs  near  confirmed  T1D  loci  (Milestone  3A) 
and  SNPs  occurring  near  interesting  genes  (Milestone  3B)  for  risk  of  T1D.  The  17  SNPs  listed  in  Table  5 
indicate  those  that  have  been  selected  for  this  step.  At  this  stage  of  the  process  we  have  acquired  the 
necessary  reagents  for  PCR  amplification  and  iPlex  (Sequenom,  Inc.  San  Diego,  CA)  style  genotyping  of  DNA 
samples.  The  DNA  samples  are  taken  from  our  repository  of  affected  (T1D)  family  trios.  The  data  will  be 
evaluated  initially  by  TDT  analysis. 

Table  5.  SNPs  selected  for  iPlex  Based  Genotyping. 


Chr 

dbSNP  ID 

Location 

Locus 

Official  Full  Name 

1 

rsl  0858046 

114,875,303 

AMPD1 

Adenosine  monophosphate  deaminase  1  (isoform  M) 

1 

rsl  091 791 3 

162,343,920 

MGST3 

Microsomal  glutathione  S-transferase  3 

1 

rs7555369 

162,344,746 

MGST3 

Microsomal  glutathione  S-transferase  3 

1 

rsl  6839682 

194,497,622 

KCNT2 

Potassium  channel,  subfamily  T,  member  2 

2 

rsl  2470503 

163,405,297 

KCNH7 

Potassium  voltage-gated  channel,  subfamily  H  (eag-related), 

member  7 
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2 

rsl  00191 1 

163,421,846 

KCNH7 

Potassium  voltage-gated  channel,  subfamily  FI  (eag-related), 
member  7 

2 

rsl  492990 

163,422,123 

KCNH7 

Potassium  voltage-gated  channel,  subfamily  FI  (eag-related), 
member  7 

2 

rsl  0203602 

163,442,376 

KCNH7 

Potassium  voltage-gated  channel,  subfamily  FI  (eag-related), 
member  7 

2 

rsl  364562 

163,449,972 

KCNH7 

Potassium  voltage-gated  channel,  subfamily  FI  (eag-related), 
member  7 

5 

rsl  0042443 

55,388,527 

IL6ST 

Interleukin  6  signal  transducer  (gp130,  oncostatin  M  receptor) 

6 

rsl  321 4605 

394,483 

IRF4 

Interferon  regulatory  factor  4 

6 

rs7765767 

147,366,631 

STXBP5 

Syntaxin  binding  protein  5  (tomosyn) 

9 

rs7864499 

91,744,788 

IL6RL1 

Interleukin  6  receptor-like  1 

9 

rsl  0982093 

115,976,726 

COL27A1 

Collagen,  type  XXVII,  alpha  1 

12 

rsl  282801 6 

868,626 

WNK1 

WNK  lysine  deficient  protein  kinase  1 

18 

rs9959583 

20,993,778 

ZNF521 

Zinc  finger  protein  521 

18 

rs8085678 

20,998,828 

ZNF521 

Zinc  finger  protein  521 

12.  Statement  of  Plans  and  Milestones  for  the  Next  Quarter 

There  are  3  goals  that  will  be  the  focus  of  our  work  during  the  next  research  quarter. 

Goal  1.  Continue  to  recruit  new  T1D  affected  singletons  and  family  trios.  Milestone  1A.  File  application  for 
access  to  the  Wellcome  Trust  Case  Control  Consortium  for  access  to  the  T1D  (N=2,000)  and  control 
(N=3,000)  cohorts.  Milestone  1 B.  Continue  to  recruit  T 1 D  case  trios  and  singletons  from  Pittsburgh. 

Critical  to  the  success  of  the  project  to  identify  novel  loci  associated  with  T1  D  is  the  inclusion  of  as  many  T1  D 
affected  family  trios  and  singletons  as  possible.  To  accomplish  this  goal  we  will,  throughout  the  lifespan  of  the 
project,  continue  to  identify  genetic  datasets  that  are  available  for  incorporation  into  our  research  (Milestone 
1A)  and  recruit  new  participants  from  resources  available  through  our  association  with  UPMC  (Milestone  IB). 
We  anticipate  that  these  milestones,  or  very  similar  milestones,  will  be  the  first  goal  of  every  research  quarterly 
report. 

Goal  2.  Genotype  select  SNPs  using  T1D  affected  family  trios.  Milestone  2A.  Using  iPlex  genotyping 
methodology  genotype  SNPs  (N=12)  occurring  near  confirmed  T1 D  susceptibility  loci  (listed  in  Table  3)  in  T1  D 
affected  family  trios  (N=595)  to  measure  association  at  truly  positive  SNPs.  Milestone  2B.  Beginning  with  the 
SNPs  (N=17)  listed  in  Table  5  of  this  quarterly  report,  genotype  available  T1D  (N=595)  affected  family  trios 
using  iPlex  methodology  (Sequenom,  Inc.,  San  Diego,  CA). 

We  will  accomplish  Goal  2  by  formatting  DNA  samples  from  595  T1D  affected  family  trios  (comprised  of  DNA 
from  father,  mother,  and  affected  offspring)  into  384-well  trays.  Sequenom  iPlex  methodology  will  be  used  to 
genotype  the  SNPs  listed  in  Tables  3  and  5  for  confirmed  and  novel  T1D  susceptibility  loci,  respectively.  The 
iPlex  methodology  is  a  mass  spectrometer  based  approach  to  DNA  genotyping  and  the  method  is  available  at 
the  University  of  Pittsburgh  Genomics  Core  Laboratory.  In  the  event  that  select  SNPs  need  to  be  confirmed  by 
an  alternative  strategy,  we  will  employ  TaqMan  genotyping  methodology  that  is  available  in  the  Pi's  laboratory. 

Goal  3.  Analyze  the  genotyping  results  garnered  during  Goal  2  in  order  to  determine  which  SNPs  replicate 
signal  for  association  with  T1D  when  analyzed  in  affected  family  trios.  Milestone  3A.  Evaluate  genotyping 
data  from  Goal  2  for  quality  based  upon  SNP  missingness  occurring  less  than  5%  of  the  time,  sample 
missingness  occurring  less  than  5%  of  the  time,  and  consistency  with  known  family  pedigrees.  Milestone  3B. 
Perform  transmission  equilibrium  test  (TDT)  analysis  of  high  quality  genotyping  data  garnered  in  Goal  2  in 
order  to  identify  SNPs  that  replicated  in  the  GWA  and  TDT  analyses. 
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The  Goal  3  will  be  accomplished  using  the  experimental  data  garnered  during  completion  of  Goal  2.  Data  will 
be  initially  characterized  for  quality  by  determined  which,  if  any,  samples  or  SNP  genotyping  assays  result  in 
failure  frequency  exceeding  5%  of  the  total  number  of  assays  performed.  Quality  control  will  also  include  a  test 
for  family  pedigree  consistent  with  the  pattern  of  Mendelian  inheritance  between  parents  and  offspring. 
Datasets  that  are  consistent  with  parental  genotypes  and  quality  control  threshold  for  missingness  will  be  used 
to  determine  by  TDT  analysis  p-values  associated  with  T1 D  susceptibility. 

In  our  third  quarterly  scientific  progress  report  (03/01/08  -  05/31/08)  we  then  reported  the  following  findings: 


Type  1  Diabetes  Genome-Wide  Association  Study  Update: 

As  outlined  in  the  following  sections,  the  research  goals  for  the  second  quarter  of  2008  were  3-fold:  1)  continue 
to  recruit  new  T1D  affected  singletons  and  family  trios;  2)  genotype  select  SNPs  using  T1D  affected  family 
trios;  and  3)  analyze  the  genotyping  results  in  order  to  determined  which  SNPs  replicate  signal  for  association 
with  T1 D.  Our  progress  for  each  goal  is  addressed  below. 

Goal  1.  Continue  to  recruit  new  T1D  affected  singletons  and  family  trios.  Milestone  1A.  File  application  for 
access  to  the  Wellcome  Trust  Case  Control  Consortium  for  access  to  the  T1D  (N=2,000)  and  control 
(N=3,000)  cohorts.  Milestone  1 B.  Continue  to  recruit  T 1 D  case  trios  and  singletons  from  Pittsburgh. 

The  overall  goal  of  the  project  is  to  identify  novel  genetic  loci  that  impact  risk  of  developing  T 1 D.  Critical  to  the 
success  of  the  research  is  the  inclusion  of  as  many  T1  D  affected  singletons  and  family  trios  as  possible.  To 
accomplish  this  goal  we  will  throughout  the  lifespan  of  the  project  identify  publicly  available  genetic  datasets 
(Milestone  1A)  and  recruit  new  participants  (Milestone  IB)  that  can  be  incorporated  into  our  study.  Milestone 
1A  has  recently  been  completed.  On  June  9,  2008  we  submitted  our  application  to  the  Wellcome  Trust  Case 
Control  Consortium  (WTCCCC)  that,  if  approved,  will  allow  access  to  the  WTCCC  cohort  of  2,000  T1D  and 
3,000  non-TID  singletons.  This  collection  of  case  and  control  samples  has  been  genotyped  previously  using 
the  same  methodology  that  was  used  during  the  genome-wide  association  scan  (GWAS)  performed  in  our 
laboratory.  An  important  outcome  of  this  is  that  it  allows  the  preexisting  WTCCC  and  Pittsburgh  datasets  to  be 
combined  in  order  to  provide  a  powerful  resource  to  test  association  statistics  between  genetic  loci  and  T1D 
risk.  We  anticipate  that  during  the  next  research  quarter  we  will  be  notified  as  to  whether  our  application  to  the 
WTCCC  has  been  approved.  If  this  is  the  case  we  will  immediately  begin  to  compare  genetic  association 
signals  observed  in  our  data  with  that  observed  by  the  WTCCC. 

The  second  milestone  of  Goal  1  was  to  continue  to  recruit  T1 D  case  trios  and  singletons  from  Pittsburgh.  The 
updated  table  for  recruited  T1D  singletons  and  family  trios  is  illustrated  below  (Table  1).  In  addition  to 
participants  recruited  at  Pittsburgh,  we  have  arranged  for  DNA  samples  from  participants  recruited  at  Ulm, 
Germany  (N=440).  The  German  cohort  consists  of  T1D  (N=1 90)  and  non-TID  (N=250)  singletons.  The  non- 
TID  samples  add  significantly  to  our  study  in  that  they  represent  a  new  collection  of  control  samples  to  which 
the  genetics  of  T1D  case  samples  can  be  compared.  The  N=2,238  samples  listed  in  Table  1  are  available  as 
DNA  and  are  now  contained  within  the  Pittsburgh  repository. 


Table  1 .  T1 D  Singleton  (N=1 ,988),  non-TI  D  Singletons  (N=250),  and  Family  Trio  (N=595)  Characteristics. 


Pittsburgh  Cohort 

GoKinD  Cohort 

German  Cohort 

Singleton 

Trio 

Singleton 

Trio 

Singleton 

Singleton 

Demographic  Characteristics: 

£QD] 

mm 

mm 

mm 

mm 

(non-TI  D) 

Number  of  Participants 

55 

26 

1743 

569 

190 

250 

Caucasian  (%) 

100 

100 

100 

100 

100 

100 

Type  1  Diabetes  (%) 

100 

100 

100 

100 

100 

0 

Pittsburgh  samples  were  recruited  at  the  University  of  Pittsburgh  Medical  Center.  GoKinD  participants  were 
recruited  by  the  Genetics  of  Kidneys  in  Diabetes  Study.  The  German  cohort  was  recruited  from  Ulm,  Germany  as 
part  of  the  EMIL  cohort  study. 
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Goal  2.  Genotype  select  SNPs  using  T1D  affected  family  trios.  Milestone  2A.  Using  iPlex  genotyping 
methodology  genotype  SNPs  occurring  near  confirmed  T1D  susceptibility  loci  in  T1D  affected  family  trios 
(N=569)  in  order  to  measure  association  at  truly  positive  SNPs.  Milestone  2B.  Beginning  with  the  novel  and 
clustered  SNPs  listed  in  Table  2  of  this  quarterly  report,  genotype  available  T1D  (N=569)  affected  family  trios 
using  iPlex  methodology. 

Genetic  loci  were  identified  using  results  obtained  from  the  T1D  GWAS  and  are  listed  in  Table  2.  Loci  have 
been  sorted  by  whether  they  represent  unique  signals  for  T1D  risk  (i.e. ,  Cluster  of  Greater  Than  6  SNPs  and 
Novel  T1D  Candidates)  or  have  been  identified  previously  by  genetic  analysis  of  preexisting  T1D  cohorts  (i.e., 
T1  DBase  Candidate).  Among  the  loci  recognized  as  unique  signals  were  genes  encoding  the  insulin  signaling 
proteins  AKT2  and  IRS2.  Although  no  previously  published  genetic  screens  have  associated  genetic 
polymorphisms  of  these  loci  with  T1D  risk,  gene  knockout  and  knockdown  studies  performed  in  mice  have 
indicated  that  insulin  signaling  through  IRS2  is  critical  for  pancreatic  beta  cell  survival,  and  activity  of  the 
IRS2/AKT2  dependent  insulin  signaling  pathway  has  been  suggested  to  play  a  role  in  T1D  onset.  During  the 
next  research  period,  the  loci  listed  in  Table  2  will  continue  to  form  the  basis  for  choosing  which  SNPs  to  test 
by  iPlex  genotyping  technology. 

Goal  3.  Analyze  genotyping  results  garnered  during  Goal  2  in  order  to  determine  which  SNPs  replicate  signal 
for  association  with  T1  D  when  analyzed  in  affected  family  trios.  Milestone  3A.  Evaluate  genotyping  data  from 
Goal  2  for  quality  based  upon  SNP  missingness  occurring  less  than  5%  of  the  time,  sample  missingness 
occurring  less  than  5%  of  the  time,  and  consistency  with  known  family  pedigrees.  Milestone  3B.  Perform 
transmission  equilibrium  test  (TDT)  analysis  of  high  quality  genotyping  data  garnered  in  Goal  2  in  order  to 
identify  SNPs  that  replicated  in  the  GWA  and  TDT  analyses. 

In  order  to  complete  the  milestones  of  Goal  3  we  have  used  iPlex  genotyping  methodology  to  evaluate 
a  subset  of  the  loci  identified  during  analysis  of  the  T1D  GWAS  (Table  3).  The  iPlex  methodology  is  a  mass 
spectrometer  based  approach  to  DNA  genotyping.  The  method  is  available  at  the  University  of  Pittsburgh 
Genomics  Core  Laboratory.  The  iPlex  generated  genotyping  results  have  been  obtained  for  10  loci  (Table  3) 
on  DNA  samples  from  569  T1D  affected  family  trios.  As  our  work  into  elucidating  the  genetics  of  T1D 
susceptibility  progresses  we  will  continue  to  evaluate  genetic  loci  for  association  with  the  T1D  phenotype. 

Using  the  criteria  stated  in  Milestones  3A  and  3B  of  Goal  3  we  evaluated  potential  risk  loci  for  genetic 
association  with  T1D.  The  first  set  of  genotyping  data  has  been  characterized  for  quality  of  the  genotyping 
results.  Quality  control  analyses  showed  that  the  first  15  SNPs  tested  exhibited  greater  than  95%  success  for 
providing  genotype  data  and  that  greater  than  95%  of  the  DNA  samples  provided  genotyping  data  for  each 
SNP  (data  not  shown).  Quality  control  testing  also  included  the  requirement  that  family  pedigrees  were 
consistent  with  the  pattern  of  Mendelian  inheritance  of  genetic  markers  between  parents  and  offspring.  It  was 
observed  that  all  T1D  family  trios  passed  the  pedigree  test.  High  quality  genotyping  results  were  also 
examined  for  accuracy  and  reproducibility  by  comparing  genotypes  determined  using  TaqMan  and  iPlex  (Table 
4).  Quality  control  analysis  of  the  iPlex  data  indicated  that  genotyping  results  were  highly  reproducible  when 
compared  with  the  identical  samples  genotyped  using  TaqMan  assays.  Taken  together,  these  analyses 
indicate  that  SNP  typing  assays,  DNA  samples,  and  the  iPlex  methodology  were  of  sufficient  quality  to 
accomplish  the  goal  of  association  testing  for  all  interesting  genetic  markers. 

Summarized  in  Table  5  are  the  results  for  genetic  association  testing  of  1 5  SNPs  with  the  T1 D  phenotype.  The 
assay  used  T1D  affected  family  trios  (N=569)  and  was  evaluated  using  the  Transmission  Disequilibrium  Test 
(TDT).  The  results  indicate  that  this  first  set  of  SNPs  did  not  replicate  the  results  observed  during  the  T1D 
GWAS  of  case  and  control  singletons.  A  likely  cause  of  this  discrepancy  is  that  the  GWAS  data  was 

confounded  by  population  stratification  of  T1D  (case)  and  non-TID  (control)  participants,  a  common 

occurrence  in  case  and  control  analyses.  The  TDT  test  is  notable  for  controlling  errors  that  correlate  with 
population  stratification.  In  fact,  the  overall  study  design  for  our  research  project  anticipated  this  possibility 

when  we  chose  to  follow  up  on  the  results  of  the  case  and  control  T1D  GWAS  with  replication  using  T1D 

affected  family  trios  and  TDT  analysis.  TDT  analysis  of  additional  loci  (Table  6)  will  occur  during  the  next 
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research  period  and  will  confirm  whether  these  other  signals,  observed  as  significant  during  the  T1D  GWAS, 
were  due  to  population  stratification  or  reflect  true  positive  signals  for  genetic  association. 


Table  2.  High  value  loci  identified  by  genome-wide  assocation  scan  for  T1 D. 


Locus 

Classification 

Locus 

Classification 

ADNP2 

Cluster  of  >  6  SNPs 

CERKL 

T1  DBase  Candidate 

CFDP1 

Cluster  of  >  6  SNPs 

CLEC16A 

T1  DBase  Candidate 

KHDRBS2 

Cluster  of  >  6  SNPs 

CNTN4 

T1  DBase  Candidate 

LOC6461 12 

Cluster  of  >  6  SNPs 

CTLA4 

T1  DBase  Candidate 

SEMA6D 

Cluster  of  >  6  SNPs 

CYBA 

T1  DBase  Candidate 

TWISTNB 

Cluster  of  >  6  SNPs 

DBP 

T1  DBase  Candidate 

AKT2 

Novel  T1D  Candidate 

DEPDC2 

T1  DBase  Candidate 

ALOX5AP 

Novel  T1D  Candidate 

DIAPH3 

T1  DBase  Candidate 

AMPD1 

Novel  T1D  Candidate 

ELF1 

T1  DBase  Candidate 

APBA2 

Novel  T1D  Candidate 

ERBB3 

T1  DBase  Candidate 

BMPR1A 

Novel  T1D  Candidate 

FADD 

T1  DBase  Candidate 

C1GALT1 

Novel  T1D  Candidate 

FBXL4 

T1  DBase  Candidate 

CCK 

Novel  T1D  Candidate 

IL2RA 

T1  DBase  Candidate 

COL27A1 

Novel  T1D  Candidate 

IL6 

T1  DBase  Candidate 

CREG1 

Novel  T1D  Candidate 

IRF8 

T1  DBase  Candidate 

GOPC 

Novel  T1D  Candidate 

ITPR3 

T1  DBase  Candidate 

IL6RL1 

Novel  T1D  Candidate 

LOC728091 

T1  DBase  Candidate 

IL6ST 

Novel  T1D  Candidate 

LPL 

T1  DBase  Candidate 

IRF4 

Novel  T1D  Candidate 

MAPK14 

T1  DBase  Candidate 

IRS2 

Novel  T1D  Candidate 

MED12L 

T1  DBase  Candidate 

KCNH5 

Novel  T1D  Candidate 

OPCML 

T1  DBase  Candidate 

KCNT2 

Novel  T1D  Candidate 

PALLD 

T1  DBase  Candidate 

KCTD18 

Novel  T1D  Candidate 

PAX4 

T1  DBase  Candidate 

MGST3 

Novel  T1D  Candidate 

PDX1 

T1  DBase  Candidate 

PGR 

Novel  T1D  Candidate 

PHTF1 

T1  DBase  Candidate 

RXFP2 

Novel  T1D  Candidate 

PKD1L1 

T1  DBase  Candidate 

RYR3 

Novel  T1D  Candidate 

POLR2B 

T1  DBase  Candidate 

SLC30A8 

Novel  T1D  Candidate 

SENP7 

T1  DBase  Candidate 

STXBP5 

Novel  T1D  Candidate 

SEZ6L 

T1  DBase  Candidate 

TPK1 

Novel  T1D  Candidate 

SLC7A1 

T1  DBase  Candidate 

TRAF3IP1 

Novel  T1D  Candidate 

SLC03A1 

T1  DBase  Candidate 

WNK1 

Novel  T1D  Candidate 

SOS2 

T1  DBase  Candidate 

ZNF521 

Novel  T1D  Candidate 

SPG7 

T1  DBase  Candidate 

AIRE 

T1  DBase  Candidate 

STAT4 

T1  DBase  Candidate 

ANKRD15 

T1  DBase  Candidate 

TLR2 

T1  DBase  Candidate 

AR 

T1  DBase  Candidate 

TLR7 

T1  DBase  Candidate 

BARD1 

T1  DBase  Candidate 

TMEM23 

T1  DBase  Candidate 

C4orf31 

T1  DBase  Candidate 

TNFSF11 

T1  DBase  Candidate 

C9orf25 

T1  DBase  Candidate 

TRAFD1 

T1  DBase  Candidate 

CAT 

T1  DBase  Candidate 

ULBP3 

T1  DBase  Candidate 

CD40 

T1  DBase  Candidate 

VDR 

T1  DBase  Candidate 

CD48 

T1  DBase  Candidate 
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Table  3.  Novel  T1D  genetic  loci  tested  using  iPlex  genotyping. 

Locus  Description 

AMPD1  Adenosine  monophosphate  deaminase  1  (isoform  M) 

COL27A1  Collagen,  type  XXVII,  alpha  1 

IL6RL1  Interleukin  6  receptor-like  1 

IL6ST  Interleukin  6  signal  transducer  (gp130,  oncostatin  M  receptor) 

IRF4  Interferon  regulatory  factor  4 

KCNH7  Potassium  voltage-gated  channel,  subfamily  H  (eag-related),  member  7 
MGST3  Microsomal  glutathione  S-transferase  3 

STXBP5  Syntaxin  binding  protein  5  (tomosyn) 

WNK1  WNK  lysine  deficient  protein  kinase  1 

ZNF521  Zinc  finger  protein  521 


Table  4.  Quality  control  analysis  for  iPlex  and  TaqMan  genotyping  of  rs8078936  and  rs222747 


Genotype  Based  Typing: 

iPlex  Genotyping 

TaqMan  Genotyping 

Chi  Square 

dbSNP  ID 

AA 

AB 

BB 

AA 

AB  BB 

p-value 

rs8078936 

394 

1423 

1252 

391 

1433  1252 

0.88 

rs222747 

182 

1170 

1735 

179 

1151  1732 

0.78 

Allele  Based  Typing: 

iPlex  Genotyping 

TaqMan  Genotyping 

dbSNP  ID 

Allele  A 

Allele  B 

Allele  A 

Allele  B 

p-value 

rs8078936 

2211 

3927 

2215 

3937 

0.98 

rs222747 

1534 

4640 

1509 

4615 

0.79 

p-values  were  calculated  using  chi  square  analysis  and  1  degree  of  freedom. 


Table  5.  TDT  results  from  T1 D  affected  family  trios  (N=569) 

Transmission  Disequilibrium  Test  Data 


dbSNP  ID 

Chr 

Location 

Locus 

Allele  1 

Allele  2 

Total 

p-value 

rsl  0858046 

1 

114,875,303 

AMPD1 

239 

235 

474 

0.85 

rsl  091 791 3 

1 

162,343,920 

MGST3 

283 

264 

547 

0.42 

rs7555369 

1 

162,344,746 

MGST3 

263 

281 

544 

0.44 

rsl  2470503 

2 

163,405,297 

KCNH7 

236 

257 

493 

0.34 

rsl  001 911 

2 

163,421,846 

KCNH7 

255 

225 

480 

0.17 

rsl  492990 

2 

163,422,123 

KCNH7 

254 

226 

480 

0.20 

rsl  0203602 

2 

163,442,376 

KCNH7 

197 

204 

401 

0.73 

rsl  364562 

2 

163,449,972 

KCNH7 

199 

205 

404 

0.77 

rsl  0042443 

5 

55,388,527 

IL6ST 

226 

238 

464 

0.58 

rsl  321 4605 

6 

394,483 

IRF4 

175 

183 

358 

0.67 

rs7765767 

6 

147,366,631 

STXBP5 

248 

267 

515 

0.40 

rs7864499 

9 

91,744,788 

IL6RL1 

225 

213 

438 

0.57 

rsl  0982093 

9 

115,976,726 

COL27A1 

112 

112 

224 

1.00 

rsl  282801 6 

12 

868,626 

WNK1 

238 

248 

486 

0.65 

rs8085678 

18 

20,998,828 

ZNF521 

272 

265 

537 

0.76 

16 


Table  6.  High  priority  SNPs  for  TDT  analysis  during  the  upcoming  research  quarter. 


dbSNP  ID 

Chr 

Location 

Locus 

dbSNP  ID 

Chr 

Location 

Locus 

rs6689415 

1 

114,908,992 

AMPD1 

rsl  0825557 

10 

51,709,687 

TMEM23 

rs2268697 

1 

115,029,016 

AMPD1 

rs2803590 

10 

88,362,973 

BMPR1A 

rs6427093 

1 

165,823,621 

CREG1 

rsl  1032755 

11 

34,533,161 

CAT 

rs2094028 

1 

194,417,119 

KCNT2 

rs574434 

11 

100,384,527 

PGR 

rsl  441 1 47 

2 

182,220,101 

CERKL 

rs7305099 

12 

845,537 

WNK1 

rs2696344 

2 

182,241,456 

CERKL 

rs7484827 

12 

46,409,830 

VDR 

rs2028201 

2 

201,011,255 

KCTD18 

rsl  1701 88 

13 

41,576,971 

TNFSF11 

rs523437 

2 

239,063,548 

TRAF3IP1 

rsl  1701 87 

13 

41,577,835 

TNFSF11 

rs580209 

2 

239,065,169 

TRAF3IP1 

rs670676 

13 

41,599,739 

TNFSF11 

rs98771 18 

3 

2,578,450 

CNTN4 

rsl  041 637 

13 

109,307,346 

IRS2 

rs6771904 

3 

152,630,421 

MED12L 

rs71 55762 

14 

49,487,928 

SOS2 

rs7654918 

4 

58,045,356 

POLR2B 

rs71 55791 

14 

49,487,980 

SOS2 

rs3431 73 

4 

121,980,468 

C4orf31 

rs2676083 

15 

31,397,903 

RYR3 

rs950286 

6 

374,457 

IRF4 

rsl  6959754 

15 

45,680,428 

SEMA6D 

rs2498587 

6 

118,133,432 

GOPC 

rsl  0781 976 

16 

73,872,339 

CFDP1 

rs6920096 

6 

147,412,791 

STXBP5 

rsl  559339 

16 

73,876,147 

CFDP1 

rs6920901 

6 

147,470,991 

STXBP5 

rs4888405 

16 

73,985,697 

CFDP1 

rs6922684 

6 

150,491,727 

ULBP3 

rsl  1076678 

16 

87,165,397 

CYBA 

rs4870524 

6 

150,820,890 

ULBP3 

rs71 84960 

16 

88,489,162 

SPG7 

rsl  294610 

7 

7,332,182 

C1GALT1 

rsl  971 613 

18 

21,589,323 

ZNF521 

rsl  404829 

7 

144,255,618 

TPK1 

rsl  064059 

18 

75,994,674 

ADNP2 

rs2380437 

8 

69,156,370 

DEPDC2 

rs3786803 

19 

35,655,444 

LOC728091 

rs7032335 

9 

34,572,883 

C9orf25 

rs687136 

19 

45,366,831 

AKT2 

rs2808784 

9 

115,961,345 

COL27A1 

rs687155 

19 

45,366,846 

AKT2 

rs2567724 

9 

115,961,754 

COL27A1 

rs281408 

19 

53,925,218 

DBP 

rsl  2975781 

19 

53,941,510 

DBP 

12.  Statement  of  Plans  and  Milestones  for  the  Next  Quarter 

There  are  3  goals  that  will  be  the  focus  of  our  work  during  the  next  research  quarter. 

Goal  1.  Continue  to  recruit  new  T1  D  affected  singletons  and  family  trios.  Milestone  1  A.  As  a  continuing  goal 
of  the  project  into  the  genetics  of  T1 D  we  will  pursue  the  recruitment  of  new  T 1 D  case  trios  and  singletons  from 
Pittsburgh.  Milestone  IB.  Continue  to  arrange  new  collaborations  with  researchers  at  other  institutions  (e.g., 
Wellcome  Trust  Case  Control  Consortium  and  EMIL  researchers  at  Ulm,  Germany)  to  gain  increased  access 
to  DNA  repositories  and  datasets  useful  for  evaluating  the  genetics  of  T1 D 

Goal  2.  Continue  to  genotype  select  SNPs  using  T1D  affected  family  trios.  Milestone  2A.  Select  loci  from 
those  listed  in  Table  6  for  use  during  genotyping  replication  studies.  Milestone  2B.  Using  the  T1D  affected 
family  trios  determine  the  genotype  of  each  SNP  chosen  during  Milestone  2A. 

Goal  3.  Analyze  the  genotyping  results  garnered  during  Goal  2  in  order  to  determine  which  SNPs  replicate 
signal  for  association  with  T1  D  when  analyzed  using  affected  family  trios.  Milestone  3A.  Analyze  the  results 
from  Goal  2  by  Transmission  Disequilibrium  Testing  to  identify  loci  that  reproduce  the  results  observed  during 
the  T1  D  GWAS  study. 
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In  the  fourth  and  final  quarterly  scientific  progress  report  (06/01/08  -  08/26/08)  of  year  01,  we  now  report  on  our 
cumulative  results. 

Type  1  Diabetes  Genome-Wide  Association  Study  Update: 

As  outlined  in  the  following  sections,  the  research  goals  for  the  third  quarter  of  2008  were  3-fold:  1)  continue  to  recruit  new 
Type  1  Diabetes  (T1D)  affected  singletons  and  family  trios;  2)  continue  to  genotype  select  SNPs  using  T1D  affected 
family  trios;  and  3)  analyze  the  genotyping  results  garnered  during  Goal  2  in  order  to  determine  which  SNPs  replicate 
signal  for  association  with  T1 D  when  analyzed  using  affected  family  trios.  Our  progress  for  each  goal  is  addressed  below. 

Goal  1.  Continue  to  recruit  new  T1D  affected  singletons  and  family  trios.  Milestone  1A.  As  a  continuing  goal  of  the 
project  into  the  genetics  of  T1D  we  will  pursue  the  recruitment  of  new  T1D  case  trios  and  singletons  from  Pittsburgh. 
Milestone  IB.  Continue  to  arrange  new  collaborations  with  researchers  at  other  institutions  (e.g.,  Wellcome  Trust  Case 
Control  Consortium  and  EMIL  researchers  at  Ulm,  Germany)  to  gain  increased  access  to  DNA  repositories  and  datasets 
useful  for  evaluating  the  genetics  of  T1 D. 

The  first  and  second  milestones  of  Goal  1  were  to  continue  to  recruit  T1  D  case  trios  and  singletons  from  Pittsburgh  and  to 
actively  pursue  collaborations  with  other  research  centers  to  increase  the  samples  available  to  our  study.  The  updated 
table  for  recruited  T1D  singletons  and  family  trios  is  illustrated  below  indicating  that  during  the  last  research  quarter  an 
additional  3  participants  were  recruited  in  Pittsburgh  (Table  1).  In  contrast,  participants  recruited  at  Ulm,  Germany  have 
increased  substantially  from  N=440  reported  previously  to  N=2,608  during  the  last  research  quarter.  The  German  cohort 
now  consists  of  T1 D  (N=858)  and  non-TI  D  (N=1 ,750)  singletons.  The  non-TI  D  samples  add  significantly  to  our  study  in 
that  they  represent  a  collection  of  control  samples  to  which  the  genetics  of  T1D  case  samples  can  be  compared.  As  of 
the  end  of  the  present  research  quarter,  the  N=4,978  samples  listed  in  Table  1  are  available  as  DNA  and  are  now 
contained  within  the  Pittsburgh  repository. 

Milestones  1 A  and  1 B  of  Goal  1  represent  goals  that  will  be  pursued  continuously  throughout  the  project.  For  the  recently 
completed  research  quarter  the  milestones  were  accomplished  upon  recruitment  of  additional  participants  from  Pittsburgh 
and,  more  significantly,  upon  receipt  of  greater  than  2,000  DNA  samples  from  our  collaborator  in  Germany. 

Table  1 .  T1 D  Singleton  (N=2,659),  non-TI  D  Singletons  (N=1 ,750),  and  T1 D  Family  Trio  (N=595)  Characteristics. 


Pittsburgh  Cohort 

GoKinD  Cohort 

German  Cohort 

Singleton 

Trio 

Singleton 

Trio 

Singleton 

Singleton 

Demographic  Characteristics: 

mm 

mm 

mm 

mm 

mm 

(non-TI  D) 

Number  of  Participants 

58 

26 

1,743 

569 

858 

1,750 

Caucasian  (%) 

100 

100 

100 

100 

100 

100 

Type  1  Diabetes  (%) 

100 

100 

100 

100 

100 

0 

Pittsburgh  samples  were  recruited  at  the  University  of  Pittsburgh  Medical  Center.  GoKinD  participants 
were  recruited  by  the  Genetics  of  Kidneys  in  Diabetes  Study.  The  German  cohort  was  recruited  from 
Ulm,  Germany  as  part  of  the  EMIL  cohort  study. 


Goal  2.  Continue  to  genotype  select  SNPs  using  T1 D  affected  family  trios.  Milestone  2A.  Select  loci  from  those  listed  in 
Table  6  (of  the  previous  quarterly  report)  for  use  during  genotyping  replication  studies.  Milestone  2B.  Using  the  T1D 
affected  family  trios  determine  the  genotype  of  each  SNP  chosen  during  Milestone  2A. 

As  indicated  in  Table  2,  select  SNPs  were  chosen  for  analysis  using  mass  spectrometry  based  genotyping  method 
developed  by  Sequenom,  Inc  (San  Diego,  CA).  SNPs  were  chosen  based  upon  the  signal  observed  during  the  initial 
genome-wide  association  (GWA)  scan  for  T1D  that  has  been  described  in  preceding  quarterly  reports.  In  total,  21  SNPs 
were  genotyped  using  the  Sequenom  iPlex  method  and  high  quality  data  was  returned  on  19,  providing  an  overall  return 
of  high  quality  data  from  SNPs  sent  for  analysis  of  90%.  The  2  SNPs  that  failed  to  provide  genotyping  data  were 
rsl  6839682  ( KCNT2 )  and  rs9959583  ( ZNF521 )  and  were  due  to  lack  of  signal  indicating  that  these  samples  may  have 
failed  to  amplify  during  sample  preparation.  Subsequent  analysis  of  the  genotyping  data  obtained  from  the  19 
successfully  genotyped  markers  indicated  that  family  pedigree  remained  intact  and  that  missing  data  due  to  failure  to 
genotype  remained  below  5%  (data  not  shown). 

Successful  completion  of  Milestones  2A  and  2B  for  the  current  research  quarter  was  achieved  upon  creation  of  the 
genotyping  data  for  these  SNPs  (Milestone  2A)  and  quality  control  analysis  for  missingness  and  family  pedigree 
(Milestone  2B). 
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Table  2.  List  of  SNPs  for  analysis  of  association  with  T 1 D. 


Candidate 

Cytogenetic 

dbSNP  ID 

Chr 

Location 

Locus 

Band 

Distance  (kb)1 

rsl  0858046 

1 

114,875,303 

AMPD1 

1  pi  3 

141.9 

rsl  091 791 3 

1 

162,343,920 

MG  ST 3 

1q23 

1,523.2 

rs7555369 

1 

162,344,746 

MG  ST 3 

1q23 

1,522.3 

rsl  2470503 

2 

163,405,397 

KCNH7 

2q24.2 

1.9 

rsl  001 911 

2 

163,421,846 

KCNH7 

2q24.2 

18.4 

rsl  492990 

2 

163,422,123 

KCNH7 

2q24.2 

18.6 

rsl  0203602 

2 

163,442,376 

KCNH7 

2q24.2 

38.9 

rsl  364562 

2 

163,449,972 

KCNH7 

2q24.2 

46.5 

rsl  0042443 

5 

55,388,527 

IL6ST 

5q1 1 

62.0 

rsl  321 4605 

6 

394,483 

IRF4 

6p25-p23 

38.3 

rs7765767 

6 

147,366,631 

STXBP5 

6q24.3 

199.9 

rs7864499 

9 

91,744,788 

IL6RL1 

9q22.2 

138.5 

rsl  0982093 

9 

115,976,726 

COL27A1 

9q32 

intron 

rsl  282801 6 

12 

868,626 

WNK1 

1 2p1 3.3 

nonsynonymous 

rs87761 0 

17 

3,422,240 

TRPV1 

1 7p1 3.3 

nonsynonymous 

rs8078936 

17 

3,427,145 

TRPV1 

1 7p1 3.3 

intron 

rsl  61 393 

17 

3,436,263 

TRPV1 

1 7p1 3.3 

intron 

rs222747 

17 

3,439,949 

TRPV1 

1 7p1 3.3 

nonsynonymous 

rs8085678 

18 

20,998,828 

ZNF521 

1 8q1 1 .2 

intron 

1 .  Distance  between  SNP  and  candidate  locus  if  extragenic.  If  SNP  is  intragenic  than  its  funtional 
position  is  indicated. 


Goal  3.  Analyze  the  genotyping  results  garnered  during  Goal  2  in  order  to  determine  which  SNPs  replicate  signal  for 
association  with  T1D  when  analyzed  using  affected  family  trios.  Milestone  3A.  Analyze  the  results  from  Goal  2  by 
Transmission  Disequilibrium  Testing  to  identify  loci  that  reproduce  the  results  observed  during  the  T1 D  GWAS  study. 

Transmission  disequilibrium  testing  (TDT)  of  the  genotype  data  obtained  during  completion  of  Goal  2  is  the  focus  of  the 
third  goal  for  the  current  research  quarter.  The  goal  was  completed  by  calculating  the  p-value  of  association  correlated 
with  the  19  SNPs  listed  in  Table  3  in  order  to  estimate  the  risk  of  developing  T1D.  The  analysis  was  based  upon 
genotyping  data  obtained  from  569  family  trios  in  which  the  offspring  was  T1 D  positive.  Of  the  19  SNPs  examined  signals 
for  inheritance  of  one  allele  over  the  other,  and  thus  an  indication  that  alleles  associate  with  T1 D,  varied  between  50%  (or 
essentially  random)  for  locus  COL27A1  and  53.7%  for  SNP  rs161393  located  within  the  TRPV1  locus.  Analysis  of  the 
genotyping  data  for  rs161393  indicated  that  out  of  486  heterozygous  parents  there  were  225  offspring  inheriting  allele  A 
and  261  that  inherited  allele  B.  This  corresponded  to  a  chi  square  value  of  2.7  and  p-value  of  0.1.  However,  the  criteria 
of  our  study  has  been  that  for  SNPs  to  be  considered  as  exhibiting  significant  association  with  T1 D  the  p-values  must  be 
less  than  0.05.  Thus,  for  the  SNPs  examined  so  far  for  association  with  T1 D  the  conclusion  is  that  they  are  not  likely  to 
significantly  impact  disease  risk. 

Completion  of  Goal  3  was  accomplished  by  analysis  of  the  genotyping  data  obtained  during  the  recently  completed 
research  quarter.  The  results  failed  to  identify  novel  T1D  loci  but  the  analysis  did  confirm  that  the  genotyping  method  and 
laboratory  procedures  in  place  in  Pittsburgh  are  sufficient  to  routinely  obtain  high  quality  genetic  information  from  at  large 
number  of  samples  (N=1,707)  and  on  a  large  number  of  genetic  markers  (N=19).  The  operational  infrastructure  that  has 
been  developed  to  accomplish  the  goals  of  the  2007  and  2008  research  period  will  be  used  to  investigate  genetic 
influence  of  diabetes  complications,  while  continuing  to  investigate  candidate  markers  for  T1D,  in  the  upcoming  research 
period(s). 


Table  3.  Transmission  disequilibrium  analysis  of  T1 D  affected  family  trios. 


dbSNP  ID 

Chr 

Location 

Locus 

Allele  1 

Allele  2 

Total 

%Trans  Chi  Sa  p-value 

rsl  0858046 

1 

114,875,303 

AMPD1 

239 

235 

474 

50.4 

0.0 

8.5E-01 

rsl  091 791 3 

1 

162,343,920 

MG  ST 3 

283 

264 

547 

51.7 

0.7 

4.2E-01 

19 


rs7555369 

1 

162,344,746 

MG  ST 3 

263 

281 

544 

48.3 

0.6 

4.4E-01 

rsl  2470503 

2 

163,405,397 

KCNH7 

236 

257 

493 

47.9 

0.9 

3.4E-01 

rsl  001 91 1 

2 

163,421,846 

KCNH7 

255 

225 

480 

53.1 

1.9 

1.7E-01 

rsl  492990 

2 

163,422,123 

KCNH7 

254 

226 

480 

52.9 

1.6 

2.0E-01 

rsl  0203602 

2 

163,442,376 

KCNH7 

197 

204 

401 

49.1 

0.1 

7.3E-01 

rsl  364562 

2 

163,449,972 

KCNH7 

199 

205 

404 

49.3 

0.1 

7.7E-01 

rsl  0042443 

5 

55,388,527 

IL6ST 

226 

238 

464 

48.7 

0.3 

5.8E-01 

rsl  321 4605 

6 

394,483 

IRF4 

175 

183 

358 

48.9 

0.2 

6.7E-01 

rs7765767 

6 

147,366,631 

STXBP5 

248 

267 

515 

48.2 

0.7 

4.0E-01 

rs7864499 

9 

91,744,788 

IL6RL1 

225 

213 

438 

51.4 

0.3 

5.7E-01 

rsl  0982093 

9 

115,976,726 

COL27A1 

112 

112 

224 

50.0 

0.0 

1.0E+00 

rsl  282801 6 

12 

868,626 

WNK1 

238 

248 

486 

49.0 

0.2 

6.5E-01 

rs87761 0 

17 

3,422,240 

TRPV1 

45 

47 

92 

48.9 

0.0 

8.3E-01 

rs8078936 

17 

3,427,145 

TRPV1 

227 

259 

486 

46.7 

2.1 

1.5E-01 

rsl  61 393 

17 

3,436,263 

TRPV1 

225 

261 

486 

46.3 

2.7 

1.0E-01 

rs222747 

17 

3,439,949 

TRPV1 

197 

208 

405 

48.6 

0.3 

5.8E-01 

rs8085678 

18 

20,998,828 

ZNF521 

272 

265 

537 

50.7 

0.1 

7.6E-01 

Statement  of  Plans  for  the  First  Quarterly  Report  of  the  2008  to  2009  research  period 

The  goals  listed  below  based  upon  Task  2  (recruit  participants  with  T1D  or  T1DN)  and  Task  3  (genotype  candidate 
genetic  markers  for  association  with  T1DN)  of  the  overall  research  project  and  will  use  our  available  cohort  of  T1DN 
(cases)  and  T1D  (controls)  to  perform  an  additional  series  of  genetic  analyses  based  upon  the  GWA  study  that  used  the 
500,000  SNP  typing  microarray  previously  reported.  The  overall  goal  will  be  to  confirm  the  strongest  genetic  signals 
associated  with  diabetes  complications  and  will  focus  upon  T1 D  nephropathy  (T1 DN). 

There  are  3  goals  that  will  be  the  focus  of  our  work  during  the  next  research  quarter. 

Goal  1.  Expand  genetic  analysis  to  include  (T1DN)  phenotype.  Milestone  1A.  Analyze  available  anthropometric  data 
linked  with  T1  D  cohort  for  presence  of  diabetes  complications.  Milestone  IB.  Determine  the  size  of  the  available  cohort 
for  T1 D  nephropathy  (T1  DN)  and  T 1 D  with  healthy  kidney  function. 

Goal  2.  Garner  genetic  data  to  test  for  association  of  select  SNPs  with  T1DN.  Milestone  2A.  Use  the  T1DN  and  T1D 
cohort  identified  during  Goal  1  to  initiate  genotyping  of  candidate  SNPs  for  their  association  with  T1  DN. 

Goal  3.  Analyze  the  results  of  genotyping  obtained  using  the  T1DN  cohort.  Milestone  3A.  Using  T1DN  family  trios 
calculate  the  p-value  for  association  of  candidate  SNPs  with  the  T1  DN  phenotype.  Milestone  3B.  Using  T1  DN  and  T1 D 
singletons  calculate  the  p-value  for  association  of  select  SNPs  with  the  T1  DN  phenotype. 

KEY  RESEARCH  ACCOMPLISHMENTS: 

1 .  Creation  of  a  sample  repository  containing  DNA  from  3,1 70  T1 D  and  1 ,750  non-TI  D  participants 

2.  Establishment  of  laboratory  methods  for  multiplex  genotyping  of  30  or  more  SNPs  simultaneously  from  as  many  as 
5,000  samples 

3.  Initiation  of  genetic  studies  to  evaluate  candidate  genetic  markers  for  association  with  diabetes  complications 

4.  Analysis  of  genetic  data  to  evaluate  candidate  genes  for  association  with  diabetes  phenotypes 

5.  Publication  of  9  manuscripts 
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REPORTABLE  OUTCOMES: 


Manuscripts  (9  publications) 

1 .  Ringquist,  S.,  Nichol,  L.,  Trucco,  M.  Transplantation  Genetics.  (2007)  In:  Emery  and  Rimoin's  Principles  and  Practice  of 
Medical  Genetics,  eds.  D.L.  Rimoin,  M.  Conner,  R.E.  Pyeritz,  B.R.  Korf,  and  A.E.  Emery  5th  Edition,  Elsevier  Books, 
Oxford. 

2.  Ringquist,  S.,  Pecoraro,  C.,  Trucco,  M.  (2007)  Web-based  program  for  pyrosequencing  primer  design.  ASHI  Quarterly 
31:50-52. 

3.  Ringquist  S,  Pecoraro  C,  Lu  Y,  Styche  A,  Rudert  WA,  Benos  PV,  Trucco  M.  (2007)  Web-based  primer  design  software 
for  genome-scale  genotyping  by  pyrosequencing.  Methods  Mol  Biol  373:25-38. 

4.  Ringquist  S,  Styche  A,  Rudert  WA,  Trucco  M.  (2007)  Pyrosequencing-based  strategies  for  improved  allele  typing  of 
human  leukocyte  antigen  loci.  Methods  Mol  Biol  373:1 15-134. 

5.  Pasquali  L,  Bedeir  A,  Ringquist  S,  Styche  A,  Bhargava  R,  Trucco  G.  (2007)  Quantification  of  CpG  island  methylation  in 
progressive  breast  lesions  from  normal  to  invasive  carcinoma.  Cancer  Lett  257:136-144. 

6.  Pasquali  L,  Trucco  M,  Ringquist  S.  (2007)  Navigating  pathways  affecting  type  1  diabetic  kidney  disease.  Pediatr 
Diabetes  8:307-322. 

7.  Luca  D,  Ringquist  S,  Klei  L,  Lee  AB,  Gieger  C,  Wichmann  HE,  Schreiber  S,  Krawczak  M,  Lu  Y,  Styche  A,  Devlin  B, 
Roeder  K,  Trucco  M.  (2008)  On  the  use  of  general  control  samples  for  genome-wide  association  studies:  genetic 
matching  highlights  causal  variants.  Am  J  Hum  Genet  82:453-463. 

8.  Zhang  L,  Perdomo  G,  Kim  DH,  Qu  S,  Ringquist  S,  Trucco  M,  Dong  HH.  (2008)  Proteomic  analysis  of  fructose-induced 
fatty  liver  in  hamsters.  Metabolism  57:1 1 1 5-1 1 24. 

9.  Lu,  L.,  Boehm,  J.,  Nichol,  L.,  Trucco,  M.,  and  Ringquist,  S.  (2008)  Multiplex  HLA  typing  by  pyrosequencing.  Methods  in 
Molecular  Biology  (in  press). 

Abstracts 

None 
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CONCLUSION: 

The  principal  conclusions  for  the  first  year  of  funding  are  that:  1)  we  have  been  successful  in  recruiting  and  obtained  via 
collaboration  a  large  cohort  (nearly  5,000  participants)  for  analysis  of  diabetes  complications  and  2)  that  laboratory 
methods  are  in  place  and  have  been  tested  for  analysis  of  large  number  of  samples  for  association  between  SNP  and 
T1 D  related  phenotypes.  Our  work  during  the  first  year  primarily  focused  on  increasing  the  cohort  size  and  on  validating 
laboratory  methods  for  genetic  analysis.  Now  that  these  have  been  accomplished  we  will  begin  to  test  our  principle 
hypothesis  that  risk  for  diabetes  complications  is  dependent  upon  genes  and  that  environmental  factors  such  as  smoking 
will  influence  that  risk. 

The  work  generated  thus  far  has  resulted  in  9  publications  that  are  listed  under  the  section  entitled  "REPORTABLE 
OUTCOMES".  These  scientific  manuscripts  have  been  published  in  peer  review  publications.  We  are  currently  editing  a 
new  manuscript  describing  our  work  on  diabetes  complications  and  anticipate  having  that  paper  ready  for  submission  to  a 
scientific  journal  prior  to  the  end  of  2008. 

The  So  What  Section.  What  are  the  implications  of  this  research?  Diabetes  affects  16  million  Americans  and  800,000 
new  cases  annually.  African,  Hispanic,  Native  and  Asian  Americans  are  particularly  susceptible  to  its  most  severe 
complications.  Costs  associated  with  diabetes  may  be  as  high  as  $132  billion.  Diabetes  accounts  for  42%  of  new  cases 
of  ESRD  with  over  1 00,000  cases  per  year  at  an  average  cost  of  $55,000  per  patient  annually. 

What  are  the  military  significance  and  public  purpose  of  this  research?  As  the  military  is  a  reflection  of  the  US  population, 
improved  prediction  of  risk  for  developing  diabetes  and  diabetic  complications  among  active  duty  members  of  the  military, 
their  families,  and  retired  military  personnel  will  potentially  allow  focused  preventative  treatment  of  at-risk  individuals, 
providing  significant  healthcare  savings  and  improved  patient  well  being. 
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T1D  GWAS  p-values  <0.001  (Revised  122807) 


T1D  GWAS  Data 


Chr 

Location 

dbSNP  ID 

Dominant 

Additive 

Recessive 

Distancel 

QC  Summary 

1 

3,127,816 

rsl 2075867 

2.2E-04 

1. IE-02 

5.4E-04 

1,009,026 

1 

4,136,842 

rs4233262 

5.9E-02 

3.9E-03 

7.1E-04 

3,879 

1 

4,140,721 

rsl 2074230 

4.0E-02 

3.4E-03 

8.6E-04 

551 ,323 

1 

4,692,044 

rs3766964 

6.0E-04 

1.6E-02 

1. IE-03 

6,214,949 

1 

10,906,993 

rsl  280985 

2.2E-04 

2.0E-03 

1.4E-03 

814,822 

1 

11,721,815 

rs6540992 

3.2E-04 

2.8E-02 

2.1E-01 

4,690,349 

1 

16,412,164 

rsl 2751 725 

4.5E-10 

6.8E-07 

5.4E-08 

2,045,443 

Poor 

1 

18,457,607 

rs223185 

1.6E-27 

0.0E  +  00 

9.8E-15 

5,547,285 

Fail 

1 

24,004,892 

rs2256179 

2.8E-02 

4.0E-03 

8.4E-04 

8,762,281 

1 

32,767,173 

rsl 091 4574 

5.0E-03 

1.2E-02 

5.4E-04 

1 ,689,877 

1 

34,457,050 

rs705207 

1.9E-02 

9.8E-04 

1.0E-01 

4,107,505 

1 

38,564,555 

rs9 13863 

8.0E-04 

8.4E-04 

3.8E-05 

59,029 

Pass 

1 

38,623,584 

rs7534892 

1 .7E-02 

4.5E-04 

1.3E-04 

676 

1 

38,624,260 

rs6678679 

2.0E-02 

4.6E-04 

1.0E-04 

14,443 

1 

38,638,703 

rs4970590 

1.8E-02 

6.3E-04 

3.4E-03 

358,739 

1 

38,997,442 

rs695023 

2.2E-02 

9.8E-03 

6.4E-04 

929,779 

1 

39,927,221 

CG01 006038 

3. IE-03 

1.7E-04 

1 .3E-02 

14,231 

1 

39,941 ,452 

rsl  7561 61 6 

3.6E-04 

3.4E-03 

8.3E-02 

4,094,550 

1 

44,036,002 

rsl  2401 81 3 

7.4E-04 

1.0E-01 

8.3E-03 

10,532,245 

1 

54,568,247 

rsl  71 10430 

3.2E-09 

5.9E-05 

1.9E-05 

1,160,672 

Fail 

1 

55,728,919 

rsl 0736392 

3.0E-02 

6.7E-04 

7.7E-02 

28,602 

1 

55,757,521 

rsl 2757934 

1 .2E-02 

3.0E-04 

2.8E-02 

215 

1 

55,757,736 

rs4927247 

9.2E-03 

2.2E-04 

2.7E-02 

5,284 

1 

55,763,020 

rs754138 

6.0E-03 

3.1E-04 

2.3E-02 

2,578 

1 

55,765,598 

rsl 979827 

9.5E-03 

4.9E-04 

4.0E-02 

85 

1 

55,765,683 

rs6588561 

8.8E-03 

4.8E-04 

4.0E-02 

2,203 

1 

55,767,886 

rsl 995229 

1.  IE-02 

6.2E-04 

3.5E-02 

1 ,642,000 

1 

57,409,886 

rs6680219 

9.2E-04 

2.0E-02 

6.9E-03 

9,826 

1 

57,419,712 

rs7548633 

8.9E-04 

2.2E-02 

8.1E-03 

1,099,192 

1 

58,518,904 

rs6668411 

1.3E-08 

3.3E-07 

2.8E-06 

1,144,974 

Poor 

1 

59,663,878 

rsl 0493264 

7.8E-04 

1.  IE-02 

3.5E-02 

1,501,319 

1 

61,165,197 

rs7530011 

9.9E-04 

2.7E-03 

3.9E-03 

2,136,484 

1 

63,301,681 

rsl 71 24480 

9.9E-03 

3.8E-02 

3.1E-04 

1,390,870 

1 

64,692,551 

rs305543 

3.7E-02 

3.9E-02 

5.9E-04 

2,919,610 

1 

67,612,161 

rs3828069 

5.7E-03 

9.3E-03 

9.8E-04 

2,660,339 

1 

70,272,500 

rsl  71 31 142 

7.0E-04 

4.6E-04 

1.0E-02 

11,101,258 

1 

81,373,758 

rsl  14641 4 

1.3E-06 

2.6E-04 

1.4E-03 

486,242 

Fail 

1 

81,860,000 

rs7523717 

1.5E-01 

3.6E-03 

8.7E-04 

6,191,759 

1 

88,051,759 

rs21 17353 

2.2E-04 

3.9E-04 

1.0E-02 

2,547,223 

1 

90,598,982 

rsl  156495 

2.3E-01 

8.7E-02 

7.2E-04 

23,145 

1 

90,622,127 

rs4658158 

2.3E-01 

2.7E-02 

5.5E-04 

686,946 

1 

91,309,073 

rsl  64989 

2.1E-04 

2.9E-03 

1 .3E-02 

3,795,023 

1 

95,104,096 

rsl  2038630 

2.4E-03 

7.6E-04 

3.5E-02 

14,056,721 

1 

109,160,817 

rsl  27721 5 

2.4E-02 

1.3E-03 

5.5E-04 

2,566,055 

1 

111,726,872 

rs4839130 

1. IE-01 

1.2E-02 

4.5E-04 

2,157,583 

1 

113,884,455 

rsl 0858002 

2.2E-03 

3.4E-04 

6.2E-03 

161,245 

1 

114,045,700 

rsl 230649 

1.2E-02 

7.2E-04 

1.0E-01 

59,631 

1 

114,105,331 

rs6679677 

3.3E-05 

1.0E-05 

4.4E-02 

111,560 

Pass 

1 

114,216,891 

rs2488457 

3.8E-03 

7.1E-04 

1.2E-01 

658,412 

1 

114,875,303 

rsl 0858046 

4.6E-03 

7.2E-04 

3.0E-01 

33,689 

1 

114,908,992 

rs6689415 

6.8E-03 

2.2E-04 

2.1E-01 

120,024 

1 

115,029,016 

rs2268697 

3.4E-03 

7.6E-05 

8.3E-02 

45,079,815 

Pass 

1 

160,108,831 

rsl  091 81 04 

9.1E-04 

3.0E-03 

1.7E-03 

1,085,715 

1 

161,194,546 

rsl 091 7653 

3.1E-02 

7.3E-04 

3.5E-03 

165,460 

1 

161,360,006 

rs2841989 

8.5E-04 

9.8E-03 

3.9E-03 

3,978,593 

1 

165,338,599 

rs9287077 

1.0E-03 

8.5E-04 

1 .5E-02 

441 ,748 

1 

165,780,347 

rsl  2042693 

1.8E-02 

8.5E-04 

6.6E-02 

474 

1 

165,780,821 

rsl 2022039 

1 .7E-02 

8.5E-04 

3.2E-02 

107,673 

1 

165,888,494 

rs7527985 

5.8E-02 

6.0E-03 

2.2E-04 

16,361,287 

1 

182,249,781 

rs4651164 

6. IE-04 

4.6E-03 

3.6E-02 

8,528,025 

1 

190,777,806 

rsl 0489868 

1.  IE-02 

3.2E-04 

1.9E-01 

3,719,816 

1 

194,497,622 

rsl 6839682 

6.3E-03 

1. IE-04 

3.7E-03 

5,190,083 

1 

199,687,705 

rs7553988 

2.2E-01 

9.7E-04 

8.3E-03 

1,119 

1 

199,688,824 

rsl 0920205 

2.9E-01 

3.4E-04 

2.3E-02 

1,739 

1 

199,690,563 

rs3887566 

3.8E-01 

5.6E-04 

2.9E-02 

61,045 

1 

199,751,608 

rs451 8935 

1.8E-08 

2.3E-06 

2.6E-05 

1,144,760 

Fail 

1 

200,896,368 

rs7517181 

6.1E-04 

5.4E-02 

6.7E-02 

4,299,521 

1 

205,195,889 

rsl 0863962 

3.2E-01 

7.7E-02 

9.1E-04 

1,036,151 

1 

206,232,040 

rs684431 

6.7E-02 

9.7E-04 

9.2E-03 

2,742,046 

1 

208,974,086 

rsl 890845 

8.3E-04 

1.2E-02 

8.0E-02 

7,648,834 

1 

216,622,920 

rsl 0482751 

8.6E-03 

6.2E-04 

1.3E-03 

1,312 

1 

216,624,232 

rs4335431 

1.7E-03 

6.1E-04 

1 .5E-02 

1,521,013 

1 

218,145,245 

rs2378330 

1.3E-03 

6.7E-04 

8.0E-02 

39,228 

1 

218,184,473 

rs2647442 

6.6E-04 

1.2E-03 

5.8E-02 

261,653 

1 

218,446,126 

rs41 42940 

9.3E-04 

6. IE-03 

5.5E-02 

4,502,325 

1 

222,948,451 

rsl  4361 71 

8.7E-04 

2.7E-03 

1.4E-01 

6,127 

1 

222,954,578 

rsl  5611 10 

7.1E-04 

4.7E-03 

1.6E-01 

2,565 

T1  DBase  Candidate  Genes 

Locus  Location  1  Location2 


Criteria  for  Candidate  SNP  Confirmed/Suggested 

Association  Population  Strong  p-values  3  SNP  within  50kb  T 1 D  Loci  All  Loci 


ARHGEF19 

ARHGEF19 

IGSF21 

IGSF21 

Intergenic 

Intergenic 

676 

Intergenic 

14,443 

Intergenic 

Intergenic 

SSBP3 

28,602 

215 

5,284 

2,578 

85 

2,203 


DAB1  57,236,167  58,488,799 

DAB1  57,236,167  58,488,799 

DAB1  57,236,167  58,488,799 


DAB1 


SSBP3 

Intergenic 

Intergenic 

Intergenic 

Intergenic 

Intergenic 

Intergenic 

Intergenic 

DAB1 

DAB1 

DAB1 


IL12RB2  67,545,635  67,635,171  Weak  Japanese 

LRRC7  69,998,446  70,360,514 

LOC646538 


IL12RB2 

LRRC7 

LOC646538 


PHTF1 

PHTF1 

PTPN22 


114,041,360  114,102,879  PHTF1 

114,041,360  114,102,879  PHTF1  PHTF1 

114,157,960  114,215,857  Mixed  Finnish,  German,  Dutch,  Canadian  Caucasian,  Colombian  PTPN22 


PHTF1 

PHTF1 

PTPN22 


AMPD1 


AMPD1 


1,119  PHLDA3 

1,739  PHLDA3 

PHLDA3 

PHLDA3  PHLDA3 


6,127 

2,565 


CNIH3 

CNIH3 


1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 


222,957,143 

rsl 965776 

6.5E-04 

3.4E-03 

1.9E-01 

1,681,099 

CNIH3 

224,638,242 

rs211 7114 

1.5E-01 

7.5E-02 

5.0E-04 

6,273,622 

PARP1  224,615,129 

224,662,414  Weak 

French,  French  Caucasian 

PARP1 

230,911,864 

rs7531764 

6.8E-05 

2.7E-03 

8.6E-04 

369,006 

Pass 

Intergenic 

Intergenic 

231 ,280,870 

rs6684434 

6.9E-04 

4.1E-03 

5.7E-04 

5,728,342 

237,009,212 

rs2689125 

1.4E-03 

4.6E-03 

7.8E-04 

1 ,668,253 

238,677,465 

rsl  241 0581 

4.2E-04 

9.9E-04 

7.3E-02 

392,560 

239,070,025 

rsl  21 16473 

6.5E-04 

2.0E-01 

1.9E-01 

326,992 

239,397,017 

rs4660036 

3.3E-02 

1.1E-02 

5.5E-04 

681,991 

240,079,008 

rs41 49852 

2.4E-02 

7.4E-04 

2.1E-02 

3,918,864 

243,997,872 

rsl  31 6384 

7.3E-04 

2.0E-02 

9.9E-02 

34,302 

244,032,174 

rs2791398 

5.0E-04 

9.8E-02 

9.5E-02 

627,597 

rsl 3388043 

3.7E-03 

1 .2E-02 

9.8E-04 

1 ,831 ,232 

2,458,829 

rsl 11 27353 

4.0E-03 

7.3E-04 

8.6E-02 

276,395 

2,735,224 

rs4853775 

5.9E-07 

8.5E-06 

3.2E-04 

256,099 

Fail 

Intergenic 

Intergenic 

2,991 ,323 

rsl 7040704 

6.7E-04 

3.8E-02 

1 .7E-02 

460,042 

3,451,365 

rsl 2994580 

2.1E-02 

8.8E-04 

3.0E-02 

4,652,034 

8,103,399 

rs6732658 

6.0E-02 

6.3E-04 

6.2E-02 

3,344,421 

11,447,820 

rs4669724 

4.7E-03 

2.4E-02 

9.1E-04 

262,488 

11,710,308 

rsl  1690767 

1. IE-01 

4.0E-02 

8.9E-04 

1,373,512 

13,083,820 

rs953006 

2.8E-04 

6.6E-03 

1 .0E-02 

95,134 

13,178,954 

rsl  02081 14 

5.3E-04 

2. IE-03 

4.4E-03 

9,398 

13,188,352 

rs2 128503 

4.3E-04 

1.5E-03 

3.3E-03 

52,320 

13,240,672 

rs67 19067 

4.0E-04 

1.9E-03 

4.7E-03 

302,149 

13,542,821 

rs480881 

2.2E-05 

3.3E-05 

7.7E-03 

744 

Pass 

Intergenic 

744 

Intergeneic 

13,543,565 

rs511773 

4.8E-04 

9.8E-04 

2.6E-02 

5,247 

5,247 

Intergeneic 

13,548,812 

rsl  1693523 

6. IE-05 

1.3E-04 

4.4E-03 

20,648 

Pass 

Intergenic 

20,648 

Intergeneic 

13,569,460 

rsl 0929862 

7.8E-05 

2.2E-04 

5.1E-03 

86 

Pass 

Intergenic 

86 

Intergeneic 

13,569,546 

rsl  1686208 

7.9E-05 

2.3E-04 

6.8E-03 

656,868 

Pass 

Intergenic 

Intergeneic 

14,226,414 

rsl 448965 

5.6E-03 

6.0E-04 

3.0E-02 

4,197 

4,197 

Intergeneic 

14,230,611 

rsl 0427320 

1 .4E-02 

3.6E-04 

6.2E-02 

3,887 

3,887 

Intergeneic 

14,234,498 

rsl  448972 

5. IE-03 

4.4E-04 

2.5E-02 

1,760,718 

Intergeneic 

15,995,216 

rsl 0929400 

8.0E-03 

4.9E-04 

1.6E-02 

1,546,313 

17,541,529 

rs368430 

9.0E-02 

6.7E-02 

5.2E-05 

34,648 

Pass 

VSNL1 

34,648 

VSNL1 

17,576,177 

rs446947 

1.0E-01 

7.7E-02 

1.7E-04 

25,503 

25,503 

VSNL1 

17,601,680 

rs426590 

1.2E-01 

9.4E-02 

1.7E-04 

5,407,687 

VSNL1 

23,009,367 

rs7586848 

4.3E-04 

3.6E-03 

1.1E-02 

6,792,835 

29,802,202 

rs2257135 

4.0E-03 

1 .2E-02 

2.6E-04 

5,966 

29,808,168 

rs2631956 

4.1E-03 

4.1E-02 

6.1E-04 

79,615 

29,887,783 

rsl  1694445 

1.2E-04 

6.3E-04 

3.2E-04 

7,938 

29,895,721 

rsl  01 87657 

4.3E-03 

5.5E-04 

1.2E-03 

98,323 

29,994,044 

rs4233753 

3.8E-03 

3.9E-03 

2.2E-04 

1,152,277 

31,146,321 

rsl  701 0651 

1.3E-10 

2. IE-04 

2.0E-04 

122,260 

Fail 

GALNT14 

GALNT14 

31,268,581 

rsl  01 92210 

6.6E-04 

8.7E-04 

4.0E-03 

5,007,239 

36,275,820 

rsl  167437 

1.5E-04 

9.9E-05 

8.0E-03 

5,351,829 

Pass 

Intergenic 

Intergenic 

41,627,649 

rsl 11 24822 

6.6E-03 

9.3E-03 

1.0E-03 

1,628,152 

43,255,801 

rs4952974 

7.0E-04 

1.4E-03 

2.7E-03 

1 ,201 ,769 

44,457,570 

rs786626 

1.0E-03 

1 .9E-02 

8.6E-04 

50,839 

44,508,409 

rsl 067367 

7.1E-04 

9.1E-03 

1.2E-03 

5,711 

5,711 

C2orf34 

44,514,120 

rsl 065786 

7.1E-04 

8.9E-03 

1.7E-03 

11,549 

11,549 

C2orf34 

44,525,669 

rsl 067347 

3.7E-04 

6.0E-03 

7.4E-04 

67,707 

C2orf34 

44,593,376 

rsl  1888605 

9.5E-04 

1 .0E-02 

4.6E-03 

6,391 

44,599,767 

rs3862996 

4.8E-04 

6.2E-03 

1. IE-03 

1,104,546 

45,704,313 

rs7565326 

1.5E-03 

7.4E-04 

4.0E-03 

557,531 

46,261 ,844 

rs2594491 

8.7E-03 

2.0E-03 

5.4E-04 

5,892,860 

52,154,704 

rsl 806739 

1 .7E-02 

2.8E-01 

7.6E-04 

4,776,793 

56,931 ,497 

rs6545601 

1 .4E-02 

2.8E-02 

6.8E-04 

26,666 

26,666 

LOC647016 

56,958,163 

rs2870503 

1 .4E-02 

3.4E-02 

6.8E-04 

15,057 

15,057 

LOC647016 

56,973,220 

rs4672171 

4.0E-03 

8.2E-03 

3.1E-04 

34,382 

34,382 

LOC647016 

57,007,602 

rs2865254 

5.9E-04 

1.4E-03 

2.1E-04 

605,017 

LOC647016 

57,612,619 

rs820985 

7.7E-05 

8.8E-04 

5.7E-03 

41 ,490 

Pass 

Intergenic 

41 ,490 

Intergenic 

57,654,109 

rsl  11 2571 8 

4.0E-04 

2.5E-03 

2.4E-02 

4,400,872 

Intergenic 

62,054,981 

rsl  1885927 

7.3E-03 

3.7E-04 

1.9E-03 

63,080 

62,118,061 

rsl  261 41 02 

7.1E-03 

3.4E-04 

8.2E-03 

2,980,808 

65,098,869 

rs759458 

1.9E-01 

2.4E-04 

3.1E-02 

1 ,928,536 

67,027,405 

rsl 7032787 

6.8E-04 

4.8E-03 

3.1E-04 

3,886,214 

70,913,619 

rsl 7006436 

4.4E-04 

2.6E-02 

5.1E-02 

2,386,708 

73,300,327 

rs7605115 

1.8E-04 

2.3E-04 

1.1E-04 

3,674,397 

76,974,724 

rsl 701 3302 

4.3E-04 

8.9E-03 

1.7E-01 

222 

76,974,946 

rsl  701 3306 

5.5E-04 

8.8E-03 

2.5E-01 

6,214,322 

83,189,268 

rsl 0496291 

4.6E-04 

2.8E-03 

2.3E-02 

3,149,558 

86,338,826 

rsl  751 051 2 

5.8E-04 

2.8E-03 

8.1E-03 

536,628 

86,875,454 

rsl 302321 3 

4.8E-03 

9.5E-04 

3.7E-02 

15,833,416 

102,708,870 

rsl 7027825 

1.3E-02 

7.7E-04 

1 .7E-02 

32,618 

32,618 

TMEM182 

102,741,488 

rs2732838 

5.6E-03 

4.7E-05 

2.7E-03 

6,761 

Pass 

TMEM182 

6,761 

TMEM182 

102,748,249 

rs2732841 

3.9E-03 

4.5E-05 

1.4E-03 

16,517 

Pass 

TMEM182 

16,517 

TMEM182 

102,764,766 

rs2732860 

1.2E-03 

1.1E-04 

1. IE-03 

16,497 

16,497 

TMEM182 

102,781,263 

rs21 10665 

8.4E-04 

5.9E-05 

1.7E-03 

1 ,277,253 

Pass 

TMEM182 

TMEM182 

104,058,516 

rsl 11 23977 

1 .0E-02 

4.5E-03 

1.5E-05 

21,878 

Pass 

LOC728838 

21,878 

LOC728838 

104,080,394 

rs2376057 

3.6E-02 

2.7E-02 

1.7E-04 

169 

169 

LOC728838 

104,080,563 

rs2889276 

6.1E-02 

8.5E-02 

5.2E-04 

728,917 

LOC728838 

104,809,480 

rs4851722 

5.0E-03 

5.3E-04 

1 .4E-02 

643,896 

2 

105,453,376 

rs6725019 

9.6E-04 

2.6E-03 

1 .9E-02 

557,828 

2 

106,011,204 

rsl 70321 92 

6.9E-03 

1.2E-04 

1 .3E-02 

12,076,316 

2 

118,087,520 

rs2551684 

3.6E-04 

3.2E-03 

1.6E-03 

27,452 

2 

118,114,972 

rs2255565 

3.6E-04 

3.4E-03 

1.8E-03 

1 ,484,829 

2 

119,599,801 

rs7570552 

6.0E-04 

4.6E-03 

3.8E-03 

370,016 

2 

119,969,817 

rs920338 

1.  IE-02 

7.9E-04 

1.9E-01 

1,690,010 

2 

121,659,827 

rsl 2467309 

7.4E-02 

1 .0E-02 

6.9E-04 

1 ,044,232 

2 

122,704,059 

rsl  432311 

3.6E-03 

4.4E-04 

3.7E-03 

9,943 

2 

122,714,002 

rs934390 

6.4E-04 

5.3E-04 

8.6E-04 

95,396 

2 

122,809,398 

rsl  881 893 

1 .5E-02 

4.4E-04 

2.9E-02 

25,853 

2 

122,835,251 

rsl 0496582 

1 .4E-02 

3.5E-04 

1 .6E-02 

132 

2 

122,835,383 

rsl 0496583 

4.2E-02 

3.9E-04 

3.3E-02 

425,221 

2 

123,260,604 

rs6754547 

2.2E-03 

4.2E-03 

1.3E-04 

16,562 

2 

123,277,166 

rs294665 

7.7E-03 

1 .3E-02 

7.7E-05 

15,297 

2 

123,292,463 

rs296042 

1.8E-03 

2.6E-02 

8.8E-05 

1,263 

2 

123,293,726 

rs296047 

2.4E-03 

3.0E-02 

1.5E-04 

9,607 

2 

123,303,333 

rsl 2469942 

2.3E-03 

2.8E-02 

1.4E-04 

12,610 

2 

123,315,943 

rs296029 

5.6E-03 

2.6E-02 

9.1E-04 

3,327 

2 

123,319,270 

rs296034 

1.2E-03 

2.2E-02 

5.3E-05 

12,113 

2 

123,331,383 

rs294643 

1.4E-03 

1 .9E-02 

4.5E-05 

73,232 

2 

123,404,615 

rs2139011 

1.  IE-02 

2.4E-02 

2.3E-04 

1,610,215 

2 

125,014,830 

rsl  701 1610 

1.5E-04 

8.8E-03 

9.2E-03 

19,863 

2 

125,034,693 

rs6760693 

9.0E-04 

5.2E-02 

1.0E-01 

2,701 ,232 

2 

127,735,925 

rs4150512 

1.8E-04 

3.2E-02 

2.8E-02 

5,433,000 

2 

133,168,925 

rs9287423 

2. IE-04 

4.4E-03 

1.7E-01 

7,469,645 

2 

140,638,570 

rsl 2328474 

1.0E-04 

1 .9E-02 

3.7E-03 

1,714,360 

2 

142,352,930 

rsl 0496905 

2.3E-03 

1.8E-03 

8.4E-04 

1 ,253,289 

2 

143,606,219 

rs354714 

1 .7E-02 

6.1E-04 

2.2E-03 

400,937 

2 

144,007,156 

rs4525633 

1 .3E-02 

6.3E-04 

5.0E-03 

6,301 ,799 

2 

150,308,955 

rsl 0490076 

1.5E-01 

8.4E-04 

6.8E-02 

3,280 

2 

150,312,235 

rsl 774521 2 

1 .4E-01 

7.1E-04 

6.3E-02 

1,149,914 

2 

151,462,149 

rsl 01 88669 

5.4E-02 

3.8E-03 

7.2E-05 

2,978,234 

2 

154,440,383 

rsl  1893783 

9.2E-04 

9.2E-03 

9.7E-02 

2,434,635 

2 

156,875,018 

rsl 684021 4 

2.3E-02 

4.7E-04 

1  .IE-01 

2,825,927 

2 

159,700,945 

rs2288105 

1.9E-04 

2.9E-02 

3.8E-02 

3,720,901 

2 

163,421,846 

rsl  001 911 

8.4E-03 

3.5E-04 

6.4E-03 

277 

2 

163,422,123 

rsl  492990 

7.0E-03 

2.4E-04 

7.4E-03 

20,253 

2 

163,442,376 

rsl 0203602 

1. IE-03 

2.4E-04 

3.7E-03 

7,596 

2 

163,449,972 

rsl 364562 

1.5E-03 

3.0E-04 

5.3E-03 

19,865 

2 

163,469,837 

rs307869 

4.3E-03 

6.4E-04 

5.7E-03 

9,912,552 

2 

173,382,389 

rs7584826 

2.7E-02 

4.9E-04 

4.3E-03 

5,126,972 

2 

178,509,361 

rsl  338641 8 

1.8E-04 

2.8E-03 

4.5E-03 

2,130,345 

2 

180,639,706 

rs262302 

1.3E-04 

6.3E-04 

3.4E-03 

8,682,875 

2 

189,322,581 

rsl  400945 

1 .0E-02 

9.7E-04 

9.9E-03 

1,691,488 

2 

191,014,069 

rs7581762 

1.3E-01 

1.3E-01 

5.6E-04 

213,036 

2 

191,227,105 

rsl 860849 

2.7E-02 

1.3E-03 

7.0E-04 

1,533,493 

2 

192,760,598 

rsl  11 3932 

9.7E-02 

9.6E-04 

3.9E-02 

8,250,657 

2 

201,011,255 

rs2028201 

4.3E-03 

8.7E-04 

3.2E-03 

3,417,129 

2 

204,428,384 

rsl  1571 292 

2.5E-02 

4.0E-03 

2.6E-04 

20,727 

2 

204,449,111 

rs231726 

7.3E-03 

3.4E-04 

5.2E-04 

300 

2 

204,449,411 

rsl  427676 

6.5E-03 

4.5E-04 

6.0E-04 

448,253 

2 

204,897,664 

rs4673296 

2.5E-03 

5.2E-04 

2.2E-03 

7,975,373 

2 

212,873,037 

rsl  741 881 4 

6.9E-04 

5.7E-04 

5.3E-04 

8,380 

2 

212,881,417 

rsl 684831 2 

1.3E-03 

8.3E-04 

1.4E-03 

2,458,130 

2 

215,339,547 

rs7599060 

4.9E-03 

4.8E-04 

1 .3E-02 

855,437 

2 

216,194,984 

rs6752170 

6.2E-03 

3.2E-04 

5.4E-03 

352,192 

2 

216,547,176 

rsl  1692400 

5.9E-04 

4.5E-03 

1 .4E-02 

889,870 

2 

217,437,046 

rsl 468860 

1 .4E-02 

6.1E-02 

3.7E-04 

965,495 

2 

218,402,541 

rs6729330 

3.9E-04 

1 .9E-02 

1.6E-01 

1,940,212 

2 

220,342,753 

rsl 035680 

1.5E-13 

1.2E-10 

1.5E-05 

1 ,889,933 

2 

222,232,686 

rsl 3030635 

1.0E-01 

3.2E-02 

7.8E-04 

1,704,316 

2 

223,937,002 

rsl  25091 5 

1 .7E-02 

6.2E-04 

8.4E-03 

3,638 

2 

223,940,640 

rs990297 

2.2E-02 

7.6E-04 

9.1E-03 

726,918 

2 

224,667,558 

rsl 01 97404 

3.0E-02 

5.7E-04 

1 .7E-02 

175 

2 

224,667,733 

rs75741 30 

2.9E-02 

4.7E-04 

1 .7E-02 

3,545,014 

2 

228,212,747 

rsl 01 79572 

4.5E-02 

7.8E-03 

4.8E-04 

1 ,701 ,272 

2 

229,914,019 

rs6757593 

1.5E-04 

1.2E-03 

7.4E-03 

3,262 

2 

229,917,281 

rs7563766 

9.7E-04 

3.4E-03 

1 .4E-02 

1,484 

2 

229,918,765 

rs67 17477 

7.5E-04 

3.3E-03 

5.2E-03 

4,430 

2 

229,923,195 

rs9288646 

2.3E-04 

1.4E-03 

4.3E-03 

1,602 

2 

229,924,797 

rs6748446 

2.7E-04 

1.6E-03 

1 .2E-02 

7,478 

2 

229,932,275 

rs7594321 

3.7E-04 

2.9E-03 

9.5E-03 

438 

2 

229,932,713 

rs6436860 

6.8E-04 

5.0E-03 

1 .7E-02 

32,117 

2 

229,964,830 

rs207671 

1.2E-04 

5.6E-04 

3.9E-03 

83,812 

2 

230,048,642 

rs671 5897 

3.2E-03 

4.0E-04 

2.5E-02 

734 

2 

230,049,376 

rs6750269 

2. IE-03 

2.2E-04 

1 .7E-02 

11,586 

2 

230,060,962 

rs7570424 

1.8E-03 

2.0E-04 

1 .7E-02 

112 

2 

230,061 ,074 

rs7609541 

2. IE-03 

2.5E-04 

1 .4E-02 

21 

2 

230,061 ,095 

rs7556846 

1.8E-03 

1.9E-04 

1 .7E-02 

125 

2 

230,061 ,220 

rs7570739 

1.3E-03 

1.3E-04 

1 .2E-02 

5,229,679 

2 

235,290,899 

rs4663163 

3.5E-03 

2.4E-04 

8.5E-02 

156 

Pass 

Pass 


Pass 

Pass 


Pass 


poor 


CTLA4 

CTLA4 

CTLA4 


BARD1 


25,853 

Intergenic 

132 

Intergenic 

Intergenic 

16,562 

Intergenic 

Intergenic 

15,297 

Intergenic 

Intergenic 

1,263 

Intergenic 

9,607 

Intergenic 

12,610 

Intergenic 

3,327 

Intergenic 

Intergenic 

12,113 

Intergenic 

Intergenic 

Intergenic 

Intergenic 


Intergenic 


277 

20,253 

7,596 

19,865 


KCNH7 

KCNH7 

KCNH7 

KCNH7 

KCNH7 


204,440,754 

204,446,928  Mixed 

South  Moroccan, 

UK, 

Russian, 

Filipit  20,727 

CTLA4 

CTLA4 

204,440,754 

204,446,928  Mixed 

South  Moroccan, 

UK, 

Russian, 

Filipir  300 

CTLA4 

CTLA4 

204,440,754 

204,446,928  Mixed 

South  Moroccan, 

UK, 

Russian, 

Filipinos,  Alsacian, 

DAISY,  Jc  CTLA4 

CTLA4 

215,301,522 

215,382,611 

BARD1 

Intergenic  Intergenic 


3,262 

I, 484 
4,430 
1,602 
7,478 
438 
32,117 

734 

II, 586 
112 

21 

125 


DNER 

DNER 

DNER 

DNER 

DNER 

DNER 

DNER 

DNER 

DNER 

DNER 

DNER 

DNER 

DNER 

DNER 


2 

235,291,055 

rs4663165 

1.8E-03 

1.5E-04 

6.9E-02 

2,401 ,323 

2 

237,692,378 

rs7570534 

1.3E-03 

1.4E-04 

3.5E-03 

26,728 

2 

237,719,106 

rs6761680 

2.8E-03 

5.4E-04 

1 .3E-02 

514 

2 

237,719,620 

rs7585219 

7.2E-04 

3.5E-04 

3.3E-03 

1,818 

2 

237,721 ,438 

rsl 0201 392 

6.3E-04 

3.8E-04 

3.3E-03 

8,764 

2 

237,730,202 

rsl  01 72321 

1.5E-03 

5.3E-04 

3.9E-03 

1,197,798 

2 

238,928,000 

rs821 804 

1.7E-05 

7.1E-05 

4.4E-05 

135,548 

Pass 

2 

239,063,548 

rs523437 

9.5E-05 

1.7E-03 

3.0E-04 

1,621 

Pass 

2 

239,065,169 

rs580209 

1.3E-04 

2.0E-03 

3.7E-04 

1 ,490,800 

2 

240,555,969 

rs4854067 

1.6E-03 

2.5E-02 

1.2E-04 

28,161 

2 

240,584,130 

rsl 23281 43 

5.6E-04 

2.5E-03 

6.6E-04 

3,078 

2 

240,587,208 

rsl  1685586 

1.9E-03 

3.0E-02 

2.4E-04 

3 

3,030,247 

rsl 7024684 

1 .5E-02 

2.9E-04 

1 .0E-02 

5,666 

3 

3,035,913 

rsl 420020 

3.1E-02 

7.5E-04 

1 .8E-02 

1,843,015 

3 

4,878,928 

rs981 9062 

9.5E-04 

9.6E-03 

1 .2E-02 

469 

3 

4,879,397 

rsl 2638937 

1.0E-03 

1 .0E-02 

8.7E-03 

1,659 

3 

4,881,056 

rsl  7041 61 4 

4.8E-04 

5.6E-03 

8.7E-03 

1,188,317 

3 

6,069,373 

rsl  6071 93 

1.3E-03 

3.1E-02 

1.7E-04 

6,811,093 

3 

12,880,466 

rsl 11 28620 

1.0E-03 

2.4E-02 

1.0E-01 

1,715,346 

3 

14,595,812 

rs9821893 

2.8E-02 

5.6E-04 

2.8E-03 

3,715 

3 

14,599,527 

rsl  171 2736 

6.8E-02 

7.7E-04 

4.7E-03 

724,602 

3 

15,324,129 

rsl 051 0436 

4.1E-03 

6.6E-04 

5.3E-04 

820,585 

3 

16,144,714 

rsl 7272271 

5. IE-03 

3.3E-04 

5.2E-03 

23,807 

3 

16,168,521 

rs2657589 

4.8E-03 

4.3E-04 

9.9E-03 

3,141,178 

3 

19,309,699 

rs727139 

3.6E-05 

9.5E-05 

3.2E-04 

8,579,831 

poor 

3 

27,889,530 

rs9849464 

2.1E-04 

1.3E-03 

2.1E-02 

39,511 

3 

27,929,041 

rsl 7020606 

5.9E-04 

9.4E-04 

2.2E-02 

1,164,299 

3 

29,093,340 

rs541374 

5.4E-02 

7.6E-04 

1 .7E-02 

15,342,494 

3 

44,435,834 

rs9865920 

2.6E-03 

1.6E-04 

1 .5E-02 

25,084 

3 

44,460,918 

rs461 0267 

3.9E-03 

2.1E-04 

3.9E-02 

11,687 

3 

44,472,605 

rs9311357 

4.8E-03 

2.9E-04 

3.8E-02 

101,502 

3 

44,574,107 

rs2293600 

7.3E-03 

4.5E-04 

1.0E-01 

11,355 

3 

44,585,462 

rs9867336 

1 .6E-02 

4.8E-04 

8.8E-02 

60,759 

3 

44,646,221 

rs6769283 

9.7E-03 

6.4E-04 

8.3E-02 

13 

3 

44,646,234 

rs6769288 

9.8E-03 

5.3E-04 

9.4E-02 

3,434 

3 

44,649,668 

rsl  171 0959 

1.  IE-02 

6.8E-04 

9.4E-02 

69,329 

3 

44,718,997 

rs6778410 

4.2E-03 

5.5E-04 

6.6E-02 

512,198 

3 

45,231,195 

rs28294 

7.4E-02 

5.7E-03 

9.8E-04 

277,033 

3 

45,508,228 

rs21 28361 

1.5E-07 

2.8E-05 

6.3E-04 

15,021,333 

poor 

3 

60,529,561 

rs2594259 

8.6E-04 

1.9E-03 

5.5E-03 

234,267 

3 

60,763,828 

rsl  447926 

3.3E-03 

3.1E-04 

4.0E-03 

1,105,912 

3 

61 ,869,740 

rsl  191 9255 

5.1E-02 

3.0E-04 

9.8E-03 

4,532,012 

3 

66,401 ,752 

rsl 3088067 

1.6E-03 

4.1E-03 

4.5E-04 

400 

3 

66,402,152 

rsl 309761 9 

1. IE-03 

3.0E-03 

6.6E-04 

7,548,983 

3 

73,951,135 

rs6784023 

4.8E-04 

8.1E-03 

4.3E-03 

3,454,203 

3 

77,405,338 

rs6784671 

6.9E-03 

8.8E-04 

1.3E-01 

4,962,595 

3 

82,367,933 

rs43641 88 

9. IE-05 

8.1E-03 

2.5E-02 

38,068 

Pass 

3 

82,406,001 

rs7645576 

1. IE-04 

1.1E-02 

2.4E-02 

13,783 

3 

82,419,784 

rs9309889 

3.8E-04 

2.5E-02 

3.9E-02 

4,619 

3 

82,424,403 

rs6778876 

8.6E-04 

3.1E-02 

4.3E-02 

9,792 

3 

82,434,195 

rs9878507 

3.2E-05 

2.7E-03 

1 .5E-02 

7,018 

Pass 

3 

82,441,213 

rs2 122356 

1.8E-04 

1 .3E-02 

2.8E-02 

512 

3 

82,441 ,725 

rsl  449862 

7.9E-05 

8.4E-03 

1 .5E-02 

10,421 

Pass 

3 

82,452,146 

rs9880194 

1.2E-04 

8.8E-03 

2.0E-02 

19,230 

3 

82,471 ,376 

rs6789603 

1.2E-04 

1 .6E-02 

1 .5E-02 

14,106 

3 

82,485,482 

rsl  191 6850 

1.8E-04 

1 .0E-02 

1 .7E-02 

10,694 

3 

82,496,176 

rsl 449865 

4.8E-05 

4.2E-03 

1 .2E-02 

15,664 

Pass 

3 

82,511,840 

rs935725 

3.8E-05 

5.7E-03 

1 .8E-02 

2,709 

Pass 

3 

82,514,549 

rs9827304 

8.7E-05 

8.0E-03 

3.2E-02 

6,275 

Pass 

3 

82,520,824 

rs9868998 

3.3E-05 

4.7E-03 

1 .5E-02 

1,292 

Pass 

3 

82,522,116 

rsl 2497206 

1.7E-04 

1.1E-02 

2.0E-02 

833 

3 

82,522,949 

rs2044752 

3.6E-04 

2.5E-02 

3.3E-02 

1,490 

3 

82,524,439 

rs9309900 

4. IE-05 

2.9E-03 

9.6E-03 

4,764 

Pass 

3 

82,529,203 

rs6783449 

9. IE-05 

3.5E-03 

1 .0E-02 

233 

Pass 

3 

82,529,436 

rs2 122355 

1.7E-05 

1.8E-03 

7.1E-03 

3,104,657 

Pass 

3 

85,634,093 

rs726610 

4.1E-02 

8.2E-04 

2.8E-02 

39,705 

3 

85,673,798 

rsl  751 71 21 

1. IE-01 

9.7E-04 

1 .9E-02 

12,535 

3 

85,686,333 

rs7652808 

1 .6E-02 

2.6E-04 

1 .2E-02 

6,844 

3 

85,693,177 

rsl 530740 

8.1E-02 

6.6E-04 

3.0E-02 

12,176 

3 

85,705,353 

rsl 2494446 

1.2E-01 

9.9E-04 

3.0E-02 

3,273,585 

3 

88,978,938 

rsl 2634593 

9.6E-04 

1 .4E-01 

1 .6E-02 

33,570 

3 

89,012,508 

rs6764404 

8.9E-04 

1.2E-01 

2.4E-02 

97,838 

3 

89,110,346 

rs9829987 

9. IE-04 

2.9E-02 

9.1E-03 

13,353,193 

3 

102,463,539 

CG03018150 

9.0E-04 

9.1E-04 

6.8E-02 

15,821 

3 

102,479,360 

rs7634592 

9.4E-04 

1.7E-03 

1  .IE-01 

24,977 

3 

102,504,337 

rsl 6843699 

8.5E-04 

7.5E-04 

6.8E-02 

428,086 

3 

102,932,423 

rs2611836 

1.9E-03 

9.5E-04 

7.7E-02 

3,092,789 

3 

106,025,212 

rs6437543 

2.8E-03 

1.5E-04 

2.0E-02 

129,729 

3 

106,154,941 

rsl 369905 

8.0E-03 

2.6E-04 

4.3E-03 

8,224,122 

3 

114,379,063 

rsl 2495201 

7.7E-04 

2.8E-02 

1.8E-01 

17,764,610 

3 

132,143,673 

rs2296024 

8.4E-04 

3.2E-02 

2.0E-01 

9,532,419 

CNTN4 

CNTN4 


SENP7 

SENP7 


2,117,247 

2,117,247 


102,525,812 

102,525,812 


26,728 

COPS8 

514 

COPS8 

1,818 

COPS8 

8,764 

COPS8 

COPS8 

TRAF3IP1 

TRAF3IP1 

TRAF3IP1 

1,621 

TRAF3IP1 

TRAF3IP1 

28,161 

NDUFA10 

3,078 

NDUFA10 

NDUFA10 

CNTN4 

CNTN4 

469 

ITPR1 

1,659 

ITPR1 

ITPR1 

KCNH8 

KCNH8 

25,084 

ZNF445 

11,687 

ZNF445 

ZNF445 

13 

ZNF197 

3,434 

ZNF197 

ZNF197 

LARS2 

LARS2 

Intergenic 

38,068 

Intergeneic 

13,783 

Intergeneic 

4,619 

Intergeneic 

9,792 

Intergeneic 

Intergenic 

7,018 

Intergeneic 

512 

Intergeneic 

Intergenic 

10,421 

Intergeneic 

19,230 

Intergeneic 

14,106 

Intergeneic 

10,694 

Intergeneic 

Intergenic 

15,664 

Intergeneic 

Intergenic 

2,709 

Intergeneic 

Intergenic 

6,275 

Intergeneic 

Intergenic 

1,292 

Intergeneic 

833 

Intergeneic 

1,490 

Intergeneic 

Intergenic 

4,764 

Intergeneic 

Intergenic 

233 

Intergeneic 

Intergenic 

Intergeneic 

39,705 

Intergeneic 

12,535 

Intergeneic 

6,844 

Intergeneic 

12,176 

Intergeneic 

Intergeneic 

15,821 

102,714,776  24,977  SENP7 

102,714,776  SENP7 


3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

3 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 

4 


141,676,092 

rsl  051 3115 

5.9E-04 

1.6E-01 

3.1E-01 

5,231 ,644 

146,907,736 

rs2459136 

8.7E-05 

5.4E-03 

1.5E-03 

18,363 

Pass 

GM2AP 

18,363 

GM2AP 

146,926,099 

rs2687861 

2.1E-04 

1 .2E-02 

6.6E-03 

4,918,331 

GM2AP 

151,844,430 

rs6440690 

8.6E-05 

3.6E-02 

4.4E-02 

785,991 

Pass 

SELT 

SELT 

152,630,421 

rs6771904 

1 .8E-02 

5.3E-04 

2.7E-03 

1,137,684 

MED12L  152,287,366 

152,634,500 

MED12L 

153,768,105 

rsl  1925597 

4.9E-02 

5.5E-04 

1.6E-01 

4,576,339 

158,344,444 

rs6809003 

2.7E-03 

9.1E-05 

7.1E-03 

88,982 

Pass 

CCNL1 

CCNL1 

158,433,426 

rsl 3081 038 

4.6E-03 

1.9E-04 

1 .2E-02 

1,462 

1,462 

CCNL1 

158,434,888 

rsl  0451 91 6 

3. IE-03 

1.5E-04 

1.1E-02 

551 

551 

CCNL1 

158,435,439 

rsl 392799 

1.2E-03 

2.1E-05 

4.3E-03 

10,853 

Pass 

CCNL1 

10,853 

CCNL1 

158,446,292 

rs9831830 

3.0E-03 

1.6E-04 

1.1E-02 

2,637 

2,637 

VEPH1 

158,448,929 

rs7642179 

3.0E-03 

1.0E-04 

6.3E-03 

7,569 

7,569 

VEPH1 

158,456,498 

rsl 500925 

3.4E-03 

1.8E-04 

1.1E-02 

627 

627 

VEPH1 

158,457,125 

rs9289980 

1 .5E-02 

7.1E-04 

1 .2E-02 

7,054 

7,054 

VEPH1 

158,464,179 

rs924517 

1 .5E-02 

8.9E-04 

8.4E-03 

9,907 

9,907 

VEPH1 

158,474,086 

rsl 2633429 

2.5E-02 

9.6E-04 

8.9E-03 

17,521 

17,521 

VEPH1 

158,491,607 

rs2874616 

2.7E-02 

9.6E-04 

1 .3E-02 

35,260 

35,260 

VEPH1 

158,526,867 

rsl 0936081 

8.7E-03 

2.8E-04 

9.7E-03 

1,677 

1,677 

VEPH1 

158,528,544 

rsl 522392 

2.3E-02 

8.5E-04 

2.1E-02 

293,539 

VEPH1 

158,822,083 

rsl  1709879 

1 .7E-02 

7.6E-03 

5.8E-04 

2,018 

158,824,101 

rs9876844 

1 .7E-02 

9.2E-03 

5.6E-04 

1 ,657,233 

160,481,334 

rs9831067 

5.4E-02 

6.1E-04 

1 .6E-02 

194,103 

160,675,437 

rs9824310 

1 .7E-02 

3.4E-03 

9.4E-04 

1 ,799,688 

162,475,125 

rs9828189 

3.9E-02 

7.9E-02 

2.2E-04 

37,361 

37,361 

NMD3 

162,512,486 

rs9811833 

3.6E-02 

7.9E-02 

2.2E-04 

29,825 

29,825 

NMD3 

162,542,311 

rs336546 

3.0E-02 

7.6E-02 

2.5E-04 

5,683,665 

NMD3 

168,225,976 

rsl 949532 

1.7E-05 

9.4E-05 

1.3E-04 

216,996 

Pass 

Intergenic 

Intergenic 

168,442,972 

rsl  1923256 

4.1E-04 

1 .2E-02 

8.0E-03 

141,846 

168,584,818 

rsl 554568 

3.6E-04 

3.1E-03 

2.5E-02 

1,487 

1,487 

SERPINI2 

168,586,305 

rsl  171 4970 

6.9E-04 

5.0E-03 

2.9E-02 

7,778 

7,778 

SERPINI2 

168,594,083 

CG03028974 

9.9E-04 

8.4E-03 

2.8E-02 

8,944,150 

SERPINI2 

177,538,233 

rs2 128850 

4. IE-04 

8.7E-02 

2.8E-02 

659 

659 

LOC7301 68 

177,538,892 

rsl 31 00792 

8. IE-04 

1.4E-01 

2.4E-02 

22,262 

22,262 

LOC7301 68 

177,561,154 

rs2 170808 

9.0E-04 

1.2E-01 

7.6E-02 

8,056,977 

LOC7301 68 

185,618,131 

rsl 533678 

3.2E-04 

4.8E-02 

1.2E-01 

19,971 

185,638,102 

rs7429010 

9.8E-04 

5.1E-02 

2.2E-01 

200,280 

185,838,382 

rs7646910 

6.1E-03 

3.0E-02 

8.0E-04 

667 

185,839,049 

rs4686705 

8.4E-03 

3.8E-02 

8.0E-04 

1 ,445,273 

187,284,322 

rs7433760 

5.9E-02 

9.1E-04 

2.6E-03 

4,365 

4,365 

ETV5 

187,288,687 

rs4686730 

6.4E-02 

1.9E-04 

1.3E-03 

5,502 

5,502 

ETV5 

187,294,189 

rs6444106 

7.0E-02 

2.8E-04 

1.4E-03 

303,984 

ETV5 

187,598,173 

rs9851300 

2.4E-02 

1 .9E-02 

5.4E-04 

1 ,677,592 

189,275,765 

rs9813167 

6.5E-04 

3.1E-04 

1.5E-03 

458,465 

189,734,230 

rsl 7605260 

1.2E-03 

1 .7E-02 

8.8E-04 

9,012 

9,012 

LPP 

189,743,242 

rsl 426261 

5.0E-04 

1.1E-02 

3.1E-04 

155 

155 

LPP 

189,743,397 

rsl 426260 

1.0E-03 

1 .2E-02 

6.2E-04 

7,909 

7,909 

LPP 

189,751,306 

rs4263285 

1. IE-03 

2.1E-02 

3.4E-04 

1,716,531 

LPP 

191,467,837 

rsl 7427594 

2.3E-01 

7.0E-02 

5.0E-04 

4,543,255 

196,011,092 

rs4434099 

1.9E-03 

9.3E-04 

1.9E-03 

1 ,380,958 

197,392,050 

rs491 6483 

3.7E-02 

5.9E-04 

7.2E-02 

5,820,851 

rs6817935 

5.4E-03 

6.1E-04 

1.6E-01 

205,170 

6,026,021 

rsl  1733383 

1.0E-01 

8.0E-04 

4.9E-02 

1,712,997 

7,739,018 

rsl  55781 8 

2.4E-03 

3.6E-04 

7.1E-04 

3,075 

7,742,093 

rs929264 

4.6E-04 

1.8E-04 

4.0E-04 

847,788 

8,589,881 

rs31 03091 

4.4E-04 

8.3E-02 

1.0E-01 

7,458,654 

16,048,535 

rs4698487 

1.3E-04 

3.7E-03 

1. IE-03 

284,523 

16,333,058 

rsl  51 4645 

6.9E-04 

7.6E-05 

6.5E-04 

2,246 

Pass 

LDB2 

2,246 

LDB2 

16,335,304 

rs2645255 

5.8E-04 

9.3E-05 

8.9E-04 

580 

Pass 

LDB2 

580 

LDB2 

16,335,884 

rs2658509 

2.2E-02 

7.7E-04 

6.6E-03 

10,844 

10,844 

LDB2 

16,346,728 

rs283021 

1 .2E-02 

6.1E-04 

7.3E-03 

6,454 

6,454 

LDB2 

16,353,182 

rs883086 

5.3E-04 

4.7E-04 

1.4E-03 

1,314,537 

LDB2 

17,667,719 

rs9784574 

1.6E-01 

7.1E-03 

9.7E-04 

1,157,874 

18,825,593 

rsl 051 6333 

1.3E-03 

7.6E-04 

8.5E-03 

6,374,818 

25,200,411 

rs939352 

2. IE-04 

6.2E-03 

4.7E-02 

3,863,633 

29,064,044 

rsl  31 1891 4 

3.2E-03 

7.2E-04 

5.2E-02 

3,508,968 

32,573,012 

rs6531531 

1.0E-05 

1.7E-03 

1.9E-04 

3,408,030 

Pass 

Intergenic 

Intergenic 

35,981,042 

rsl  0001 947 

4.7E-02 

8.5E-04 

4.8E-02 

6,514,171 

42,495,213 

rs2575515 

8.2E-05 

6.9E-03 

1. IE-03 

14,833 

Pass 

LOC389207 

14,833 

LOC389207 

42,510,046 

rs4861224 

7.8E-04 

2.5E-02 

3.6E-03 

99,351 

LOC389207 

42,609,397 

rs4861237 

1.3E-03 

1.9E-04 

2.2E-02 

25,052 

25,052 

LOC389207 

42,634,449 

rs2345759 

6.5E-03 

3.8E-04 

1.0E-01 

20,976 

20,976 

LOC389207 

42,655,425 

rs992133 

4.0E-03 

2.3E-04 

6.4E-02 

64,587 

LOC389207 

42,720,012 

rsl 0938230 

8.6E-03 

5.2E-04 

1  .IE-01 

17,151 

17,151 

LOC389207 

42,737,163 

rs920153 

5.2E-03 

3.7E-04 

6.1E-02 

6,839 

6,839 

LOC389207 

42,744,002 

rsl 0938235 

7.2E-03 

3.0E-04 

9.8E-02 

5,412 

5,412 

LOC389207 

42,749,414 

rs720961 

9.0E-03 

3.6E-04 

1  .IE-01 

4,307 

4,307 

LOC389207 

42,753,721 

rs6855602 

9. IE-03 

3.6E-04 

1  .IE-01 

7,773 

7,773 

LOC389207 

42,761,494 

rsl  5951 87 

7.2E-03 

3.4E-04 

1.3E-01 

12,833,910 

LOC389207 

55,595,404 

rs4864949 

9.7E-04 

1.3E-01 

3.1E-02 

339,640 

55,935,044 

rs4864984 

1 .8E-02 

3.6E-04 

1.1E-02 

12,537 

55,947,581 

rs7664496 

3.5E-02 

8.3E-04 

3.4E-02 

5,144,943 

4 

61,092,524 

rsl  51 1 1 21 

1. IE-01 

6.7E-04 

1.3E-01 

17,800 

4 

61,110,324 

rsl 70901 88 

2.7E-02 

9.1E-04 

3.1E-02 

471 ,272 

4 

61,581,596 

rs6829909 

2.7E-04 

3.2E-03 

3.3E-03 

4,485,359 

4 

66,066,955 

rs46041 04 

6.2E-04 

2.5E-02 

2.5E-02 

7,892,583 

4 

73,959,538 

rsl  86403 

5.0E-05 

7.9E-03 

2.1E-04 

29,704 

Pass 

4 

73,989,242 

rs9 19047 

7.2E-04 

4.7E-02 

7.2E-04 

14,901,720 

4 

88,890,962 

rsl  1733008 

2.0E-02 

3.9E-02 

4.8E-04 

4,970,186 

4 

93,861,148 

rsl  01 55056 

9.7E-04 

1.6E-03 

1.8E-03 

64,919 

4 

93,926,067 

rsl 7259489 

7.1E-03 

4.1E-03 

6.8E-04 

265,258 

4 

94,191,325 

rs4491988 

4. IE-03 

5.5E-04 

4.6E-04 

634 

4 

94,191,959 

rs2870656 

1.7E-03 

5.3E-04 

5.8E-04 

3,190,528 

4 

97,382,487 

rsl 702521 9 

1.6E-04 

4.4E-03 

3.6E-03 

1,418 

4 

97,383,905 

rs7653937 

1.6E-04 

4.1E-03 

5.4E-03 

24,635 

4 

97,408,540 

rs50 17329 

1.0E-04 

3.6E-03 

4.4E-03 

10,351 

Pass 

4 

97,418,891 

rs6830746 

1.4E-04 

3.0E-03 

2.8E-03 

41,669 

4 

97,460,560 

rs2166016 

1.2E-04 

3.7E-03 

4.4E-03 

18,818 

4 

97,479,378 

rsl 605227 

3.0E-04 

7.5E-03 

7.8E-03 

5,946 

4 

97,485,324 

rs6832934 

1.9E-04 

4.7E-03 

6.6E-03 

109,981 

4 

97,595,305 

rs4699289 

8.2E-04 

1 .8E-02 

1 .2E-02 

9,423,460 

4 

107,018,765 

rsl 051 6529 

1.2E-03 

6.7E-04 

4.9E-02 

2,395,045 

4 

109,413,810 

rs2880544 

5.7E-03 

1.6E-03 

2.9E-04 

62,495 

4 

109,476,305 

rsl 7038826 

5. IE-03 

5.1E-04 

2.3E-04 

14,921 

4 

109,491,226 

rs917412 

8.2E-03 

8.7E-04 

1.1E-05 

1 ,243,784 

Pass 

4 

110,735,010 

rs6851009 

3.9E-02 

7.1E-04 

9.2E-03 

798,009 

4 

111,533,019 

rs2348205 

7.0E-03 

2.6E-03 

9.8E-04 

3,190,748 

4 

114,723,767 

rsl 0488894 

7.2E-05 

4.8E-02 

4.1E-02 

6,063,316 

Pass 

4 

120,787,083 

rs2389883 

6.4E-02 

8.3E-03 

2.7E-04 

372,050 

4 

121,159,133 

rsl 503534 

1.  IE-02 

8.2E-04 

1.7E-01 

504,018 

4 

121,663,151 

rs7441640 

2.3E-03 

9.4E-04 

1.4E-03 

179,815 

4 

121,842,966 

rsl  496276 

8.6E-04 

2.4E-03 

4.6E-02 

1,394,618 

4 

123,237,584 

rs3 108397 

9.4E-04 

2.1E-01 

1.2E-01 

8,780,187 

4 

132,017,771 

rsl  75021 37 

2.4E-03 

7.9E-03 

4.0E-05 

2,915,886 

Pass 

4 

134,933,657 

CG04022664 

5.6E-02 

2.6E-04 

1 .7E-02 

2,304,178 

4 

137,237,835 

rsl 344033 

2.7E-04 

3.2E-03 

3.7E-03 

3,527,821 

4 

140,765,656 

rsl  1945690 

5.1E-04 

5.9E-02 

4.5E-02 

10,635,063 

4 

151,400,719 

rsl 000321 5 

2.6E-06 

3.6E-04 

3.3E-04 

3,612,222 

Fail 

4 

155,012,941 

rsl  456385 

3.9E-04 

1 .6E-02 

1 .7E-02 

6,306 

4 

155,019,247 

rsl  051 7581 

3.3E-04 

1 .5E-02 

1 .8E-02 

145 

4 

155,019,392 

CG04025829 

4.2E-04 

1 .5E-02 

1 .8E-02 

24,389 

4 

155,043,781 

rsl 7299833 

1.6E-04 

7.1E-03 

9.6E-03 

7,858,154 

4 

162,901,935 

rsl 560925 

3.8E-03 

1 .4E-02 

9.0E-04 

2,331,689 

4 

165,233,624 

rs7659197 

1 .7E-02 

3.6E-04 

2.4E-01 

4,856 

4 

165,238,480 

rs6536795 

7.7E-03 

2.6E-04 

1.2E-01 

3,521,881 

4 

168,760,361 

rs7657818 

3.1E-04 

1.4E-03 

1 .2E-02 

1,082,132 

4 

169,842,493 

rs4692947 

1.1E-04 

3.1E-04 

5.6E-04 

958,398 

4 

170,800,891 

rsl 7627781 

1.5E-06 

2.4E-05 

3.4E-05 

1 ,572,837 

Fail 

4 

172,373,728 

rsl  03881 9 

6.3E-03 

7.7E-03 

4.5E-04 

5,332,245 

4 

177,705,973 

rsl  251 1856 

9.9E-04 

4.8E-05 

3.0E-02 

648,458 

Pass 

4 

178,354,431 

rsl 0446826 

8.1E-02 

3.3E-02 

4.5E-04 

35,439 

4 

178,389,870 

rs6821871 

1. IE-01 

4.6E-03 

8.3E-04 

4,078,606 

4 

182,468,476 

rsl  0520475 

8.9E-04 

9.1E-04 

2.5E-02 

1,122,957 

4 

183,591,433 

rsl  11 321 24 

4.0E-02 

2.3E-03 

7.3E-04 

2,574,340 

4 

186,165,773 

rsl  1737460 

7.3E-21 

4.8E-14 

2. IE-10 

2,872,754 

Fail 

4 

189,038,527 

rs4862838 

1.0E-04 

1 .0E-02 

6.9E-02 

1,031,117 

4 

190,069,644 

rsl  1737653 

3.0E-04 

9.5E-03 

3.4E-02 

5 

3,332,885 

rs4645337 

5.1E-04 

6.7E-03 

1.1E-02 

3,222,269 

5 

6,555,154 

rs6873584 

1 .0E-02 

9.4E-04 

1.0E-02 

394,103 

5 

6,949,257 

rs903867 

2.4E-03 

1 .5E-02 

8.7E-05 

11,635 

Pass 

5 

6,960,892 

rs903863 

1.3E-03 

1.1E-02 

4.7E-05 

27,095 

Pass 

5 

6,987,987 

rsl  11 341 94 

3.5E-03 

4.0E-02 

2.3E-04 

3,096,547 

5 

10,084,534 

rsl 00621 64 

5.6E-04 

1.3E-03 

1 .3E-02 

3,362,718 

5 

13,447,252 

rs985766 

2. IE-03 

5.6E-03 

4.7E-04 

2,032 

5 

13,449,284 

rs6875759 

2.9E-03 

6.4E-03 

4.3E-04 

29,820 

5 

13,479,104 

rs4702943 

2.7E-03 

7.2E-03 

4.0E-04 

12,354 

5 

13,491,458 

rsl  00641 10 

2.2E-03 

9.2E-03 

4.1E-04 

499,018 

5 

13,990,476 

rsl 7278234 

3.9E-04 

2.9E-03 

2.6E-03 

1 ,035,547 

5 

15,026,023 

rs835087 

1.3E-04 

8.9E-03 

1 .9E-02 

10,600 

5 

15,036,623 

rs835071 

1.6E-05 

2.7E-03 

3.0E-02 

120 

Pass 

5 

15,036,743 

rs835070 

5.9E-04 

2.4E-02 

8.8E-02 

2,783,566 

5 

17,820,309 

rsl 0941 538 

8.0E-04 

8.3E-02 

3.0E-01 

106,574 

5 

17,926,883 

rsl  5421 28 

8.8E-04 

6.3E-02 

1.3E-01 

225,841 

5 

18,152,724 

rsl 6887248 

1.5E-04 

4.2E-02 

3.8E-03 

417 

5 

18,153,141 

rsl 766081 5 

2.1E-04 

5.7E-02 

7.6E-03 

1 ,376,300 

5 

19,529,441 

rsl 486849 

2.4E-02 

7.6E-04 

2.2E-01 

933 

5 

19,530,374 

rsl 721 6535 

1.1E-02 

6.7E-04 

9.6E-02 

781 

5 

19,531,155 

rsl  486851 

1.1E-02 

7.2E-04 

1.3E-01 

30,693 

5 

19,561,848 

rs3811865 

8.4E-03 

8.2E-04 

1.5E-01 

5,702,136 

5 

25,263,984 

rs39 13960 

7.2E-03 

6.9E-04 

6.9E-04 

7,397,253 

5 

32,661 ,237 

rs864674 

3.1E-03 

7.1E-04 

2.5E-03 

4,102,149 

5 

36,763,386 

rs36650 

7.9E-03 

6.7E-04 

1 .3E-02 

17,290,603 

5 

54,053,989 

rs3886214 

7.1E-03 

3.0E-02 

2.7E-04 

470,687 

PALLD  169,654 


LOC728019 


Intergenic 

CAMK2D 

Intergenic 

LRBA 


CLCN3 

Intergenic 

HELT 

Intergenic 

Intergenic 


LOC4021 98 


29,704  LOC728019 

LOC728019 


1,418 

Intergenic 

24,635 

Intergenic 

10,351 

Intergenic 

41,669 

Intergenic 

18,818 

Intergenic 

5,946 

Intergenic 

Intergenic 


CAMK2D 


Intergenic 


6,306 

145 

24,389 


LRBA 

SFRP2 

SFRP2 

SFRP2 

SFRP2 


PALLD 

CLCN3 


Intergenic 


HELT 


11,635 

Intergenic 

27,095 

Intergenic 

Intergenic 

2,032 

Intergenic 

29,820 

Intergenic 

12,354 

Intergenic 

Intergenic 

10,600 

LOC4021 98 

120 

LOC4021 98 

LOC4021 98 

933 

CDH18 

781 

CDH18 

30,693 

CDH18 

CDH18 

5 

54,524,676 

rsl  21 1 0186 

2.0E-02 

1.6E-04 

2.4E-02 

863,851 

5 

55,388,527 

rsl 0042443 

2.2E-04 

1.4E-05 

1.4E-04 

2,299 

Pass 

5 

55,390,826 

rsl 0044080 

1. IE-03 

3.9E-05 

1.2E-03 

502,606 

Pass 

5 

55,893,432 

rs4301182 

2.8E-03 

7.4E-05 

7.9E-03 

4,995,848 

Pass 

5 

60,889,280 

rsl  31 77031 

5.1E-05 

1.7E-03 

2.7E-03 

6,453,252 

Pass 

5 

67,342,532 

rs281441 

4.0E-02 

1.6E-04 

2.3E-02 

10,759,525 

5 

78,102,057 

rsl 0068924 

6.3E-02 

3.1E-02 

5.8E-04 

3,930,423 

5 

82,032,480 

rsl 827389 

1.2E-01 

4.3E-04 

1 .8E-02 

7,279,762 

5 

89,312,242 

rsl 047431 2 

4.0E-03 

1.4E-04 

7.5E-04 

9,768 

5 

89,322,010 

rsl 560208 

2.4E-03 

1.1E-04 

2.1E-04 

78,065 

5 

89,400,075 

rs8 18944 

3.9E-03 

2.5E-04 

8.8E-04 

10,846 

5 

89,410,921 

rs256515 

3. IE-03 

2.6E-04 

4.9E-04 

373 

5 

89,411,294 

rsl  81 930 

4.5E-03 

2.8E-04 

7.1E-04 

13,721 

5 

89,425,015 

rsl  57559 

2. IE-03 

1.1E-04 

2.0E-04 

15,643 

5 

89,440,658 

rs924628 

3.5E-03 

3.5E-04 

3.4E-04 

10,555,286 

5 

99,995,944 

rs32282 

3.6E-02 

6.3E-03 

8.0E-04 

164,532 

5 

100,160,476 

rs32477 

2.5E-01 

3.7E-03 

5.7E-04 

9,199,107 

5 

109,359,583 

rs36789 

5.1E-04 

2.7E-02 

1  .IE-01 

52,622 

5 

109,412,205 

rsl  71 62591 

4.3E-04 

3.7E-03 

2.6E-03 

1,648,803 

5 

111,061,008 

rsl  71 33647 

8.0E-04 

4.2E-02 

7.1E-01 

874,849 

5 

111,935,857 

rs9790990 

9.7E-04 

1 .8E-02 

1 .0E-02 

6,760,092 

5 

118,695,949 

rs986649 

1 .9E-02 

2.6E-03 

5.6E-04 

328,106 

5 

119,024,055 

rs780428 

6.2E-02 

1.0E-03 

6.4E-03 

51,671 

5 

119,075,726 

rsl  1241 51 5 

1.9E-02 

9.2E-04 

3.3E-02 

1 ,866,497 

5 

120,942,223 

rs4307116 

7.3E-03 

1.9E-04 

5.8E-02 

1 ,563,550 

5 

122,505,773 

rs467069 

6.7E-03 

7.8E-05 

3.3E-03 

6,015,793 

Pass 

5 

128,521,566 

rs7726142 

1.5E-01 

1.0E-01 

9.3E-04 

6,451,415 

5 

134,972,981 

rs771 5094 

1 .2E-02 

1 .2E-02 

9.4E-04 

8,261 ,520 

5 

143,234,501 

rsl 0075045 

8.0E-04 

5.5E-03 

1 .7E-02 

10,838 

5 

143,245,339 

rs9324934 

5.4E-04 

6.7E-03 

2.2E-02 

89,330 

5 

143,334,669 

rs3 12622 

1.6E-04 

4.2E-02 

2.9E-02 

1,159,814 

5 

144,494,483 

rs77 10868 

2.0E-02 

8.7E-04 

2.0E-02 

1 ,682,578 

5 

146,177,061 

rsl 432892 

2.7E-03 

9.4E-04 

5.6E-04 

11,277,493 

5 

157,454,554 

rsl 744344 

1.9E-04 

3.6E-02 

7.1E-03 

618,246 

5 

158,072,800 

rs37341 05 

2.6E-03 

6.2E-04 

2.4E-02 

5,062 

5 

158,077,862 

rs41 47383 

2.3E-03 

8.3E-04 

2.5E-02 

49,129 

5 

158,126,991 

rsl 006781 3 

7.8E-04 

2.7E-03 

1.8E-03 

752,729 

5 

158,879,720 

rs2421051 

3.1E-02 

1 .0E-02 

9.7E-04 

151,381 

5 

159,031,101 

rs461 6877 

7.6E-02 

6.5E-04 

2.0E-02 

147,200 

5 

159,178,301 

rsl  70571 61 

5.9E-04 

2.3E-02 

9.1E-05 

9,150,496 

Pass 

5 

168,328,797 

rsl 0053858 

6.1E-04 

3.7E-03 

1.3E-03 

206 

5 

168,329,003 

rsl 0053973 

4.0E-04 

1.1E-03 

1.4E-04 

446,983 

5 

168,775,986 

rs929758 

2.2E-04 

4.9E-05 

8.3E-05 

57 

Pass 

5 

168,776,043 

rs929759 

1.8E-04 

5.4E-05 

5.2E-05 

19,967 

Pass 

5 

168,796,010 

rs39797 

2.8E-04 

1.7E-04 

1.6E-04 

3,411,859 

5 

172,207,869 

rs792965 

5.0E-06 

4.1E-05 

1.1E-04 

2,339,895 

Pass 

5 

174,547,764 

rs2381985 

4.6E-02 

2.5E-01 

4.4E-05 

131 

Pass 

5 

174,547,895 

rs2381986 

1. IE-01 

4.2E-01 

2.2E-04 

14,338 

5 

174,562,233 

rs6556193 

4.4E-02 

5.6E-02 

3.8E-04 

6 

374,457 

rs950286 

5.2E-05 

6.7E-06 

4.8E-03 

12,749 

Pass 

6 

387,206 

rs4959880 

3.8E-04 

1.1E-04 

1 .0E-02 

7,277 

6 

394,483 

rsl  321 4605 

1.6E-03 

8.7E-04 

1.3E-01 

569,823 

6 

964,306 

rs845887 

7.1E-02 

2.9E-04 

3.3E-02 

2,165 

6 

966,471 

rs845882 

1.0E-01 

9.6E-04 

3.0E-02 

3,762,957 

6 

4,729,428 

rsl 71 38843 

6.8E-05 

1 .9E-02 

1 .3E-02 

1,908,891 

Pass 

6 

6,638,319 

rs2235493 

6.6E-04 

2.6E-02 

2.2E-02 

561,321 

6 

7,199,640 

rs27 14342 

1 .4E-01 

5.5E-02 

4.4E-04 

1,787,791 

6 

8,987,431 

rs9328516 

4.3E-03 

7.3E-05 

5.4E-04 

20,029 

Pass 

6 

9,007,460 

rs2093745 

1 .5E-02 

1.5E-04 

2. IE-03 

6,167 

6 

9,013,627 

rs2209123 

1.3E-02 

1.5E-04 

2.2E-03 

1 ,482,456 

6 

10,496,083 

rs4527693 

1.4E-03 

7.9E-04 

6.7E-04 

5,020,424 

6 

15,516,507 

rs2237150 

2.8E-01 

2.9E-01 

9.3E-04 

21,203 

6 

15,537,710 

rs4085876 

1.3E-01 

4.1E-01 

5.2E-04 

7,659 

6 

15,545,369 

rs2282827 

8.9E-03 

6.2E-02 

5.1E-04 

883,749 

6 

16,429,118 

rs2038018 

9.9E-04 

1 .3E-02 

2.9E-02 

688 

6 

16,429,806 

rs2237224 

8.3E-04 

9.5E-03 

1 .9E-02 

3,441 ,826 

6 

19,871,632 

rs93501 78 

1.6E-04 

2.4E-04 

1.6E-04 

1,331,085 

6 

21,202,717 

rs6939622 

4.6E-02 

3.5E-03 

2.7E-04 

2,847,139 

6 

24,049,856 

rs9466992 

9.7E-07 

3.5E-05 

5.6E-04 

3,831,955 

Pass 

6 

27,881,811 

rs200481 

4.4E-04 

2.6E-04 

9.2E-02 

34,360 

6 

27,916,171 

rs201002 

1.6E-04 

7.4E-05 

4.9E-02 

5,502 

Pass 

6 

27,921 ,673 

rs200995 

1.9E-04 

7.4E-05 

4.9E-02 

1,800 

Pass 

6 

27,923,473 

rs200991 

2.6E-04 

2.6E-04 

4.3E-02 

154,537 

6 

28,078,010 

rsl  49946 

1.8E-03 

9.1E-04 

5.1E-02 

41,621 

6 

28,119,631 

rs202906 

2.8E-04 

1.0E-04 

3.5E-02 

9,617 

6 

28,129,248 

rs203887 

9.8E-04 

3.2E-04 

2.6E-02 

320 

6 

28,129,568 

rs203888 

9.7E-05 

9.1E-05 

1 .5E-02 

2,162 

Pass 

6 

28,131,730 

rs9393881 

9.6E-04 

6.1E-04 

3.9E-02 

183,612 

6 

28,315,342 

rsl  150709 

6.7E-04 

6.9E-03 

2.9E-01 

100,671 

6 

28,416,013 

rsl  7301 128 

8.5E-04 

2.3E-03 

1.3E-01 

1,102 

6 

28,417,115 

rsl  11 9211 

2.2E-04 

5.5E-04 

5.2E-02 

224,810 

IL6ST  2,299  IL6ST 

IL6ST  IL6ST 

Intergenic  Intergenic 

ZSWIM6  ZSWIM6 


10,846 

373 

13,721 

15,643 


Intergenic 

Intergenic 

Intergenic 

Intergenic 

Intergenic 


PRDM6  PRDM6 


5,062  EBF1 

49,129  EBF1 

EBF1 

Intergenic  Intergenic 

Intergenic  57  Intergenic 

Intergenic  19,967  Intergenic 

Intergenic 

ERGIC1  ERGIC1 

Intergenic  131  Intergenic 

14,338  Intergenic 

Intergenic 

IRF4  12,749  IRF4 

7,277  IRF4 

IRF4 

CDYL  CDYL 

LOC389365  20,029  LOC389365 

6,167  LOC389365 

LOC389365 

21,203  JARID2 

7,659  JARID2 

JARID2 


LOC645385 

34,360 

Histone  Gene  Cluster  5,502 
Histone  Gene  Cluster  1 ,800 

41,621 

9,617 

320 

Olfactory  Receptor  2,162 


LOC645385 
Histone  Gene  Cluster 
Histone  Gene  Cluster 
Histone  Gene  Cluster 
Histone  Gene  Cluster 
Olfactory  Receptor 
Olfactory  Receptor 
Olfactory  Receptor 
Olfactory  Receptor 
Olfactory  Receptor 


6 

28,641 ,925 

rs4 18092 

9.3E-05 

6.5E-05 

2.8E-02 

41 ,226 

Pass 

6 

28,683,151 

rsl 474986 

1.8E-04 

1.3E-03 

2.9E-02 

159,504 

6 

28,842,655 

rsl 233604 

1.0E-03 

2.8E-04 

5.0E-02 

61,180 

6 

28,903,835 

rs3 135296 

8.0E-05 

5.6E-05 

1 .7E-02 

235,286 

Pass 

6 

29,139,121 

rs3117143 

1.8E-04 

4.5E-05 

2.7E-02 

26,497 

Pass 

6 

29,165,618 

rs3 129788 

6.0E-05 

4.2E-05 

2.3E-02 

15,455 

Pass 

6 

29,181,073 

rs3131091 

3.1E-04 

1.2E-03 

9.8E-02 

94,481 

6 

29,275,554 

rs3116830 

3.4E-05 

4.1E-05 

1 .2E-02 

10,939 

Pass 

6 

29,286,493 

rs2206040 

5.2E-04 

1.9E-03 

1.2E-01 

16,497 

6 

29,302,990 

rs3130813 

7.8E-05 

3.6E-04 

8.7E-03 

30,784 

Pass 

6 

29,333,774 

rs3117330 

7.5E-04 

5.5E-03 

6.3E-02 

30,625 

6 

29,364,399 

rsl  8841 23 

9.3E-04 

5.2E-03 

4.2E-02 

4,011 

6 

29,368,410 

rs3117425 

2.0E-05 

5.1E-06 

2.9E-03 

82,344 

Pass 

6 

29,450,754 

rs3749971 

7.2E-05 

1.0E-05 

7.7E-03 

22,176 

Pass 

6 

29,472,930 

rs2073151 

7.8E-04 

1.9E-03 

5.9E-03 

11,434 

6 

29,484,364 

rs406511 

4.9E-06 

2.8E-07 

4.4E-04 

39,079 

Pass 

6 

29,523,443 

rs2523443 

1. IE-05 

6.3E-06 

1.3E-03 

28,657 

Pass 

6 

29,552,100 

rsl 233495 

9.2E-03 

8.2E-04 

9.4E-02 

102,678 

6 

29,654,778 

rsl 233396 

3.0E-04 

1.4E-04 

8.0E-03 

1,290 

6 

29,656,068 

rs926552 

9.4E-04 

4.1E-04 

3.2E-03 

438,235 

6 

30,094,303 

rs31 15631 

6.6E-06 

6.9E-06 

2.6E-03 

106,813 

Pass 

6 

30,201,116 

rs2517592 

8.8E-03 

6.5E-04 

1 .9E-02 

10,223 

6 

30,211,339 

rsl  541 269 

5.9E-06 

1.4E-06 

4.5E-04 

49,107 

Pass 

6 

30,260,446 

CG06006774 

1.4E-06 

2.0E-06 

4.4E-04 

5,660 

Pass 

6 

30,266,106 

rs2517618 

5.9E-04 

1.0E-04 

5.9E-03 

11,200 

6 

30,277,306 

rs2517611 

1.8E-03 

9.1E-04 

6.2E-03 

62,502 

6 

30,339,808 

rs3094070 

1. IE-03 

6.8E-04 

2.4E-02 

421 

6 

30,340,229 

rs3 130404 

9.3E-04 

5.3E-04 

2.0E-02 

1,902 

6 

30,342,131 

rs3 130405 

7.7E-04 

6.7E-04 

2.9E-02 

516 

6 

30,342,647 

rs2844764 

1.6E-03 

7.6E-04 

1 .6E-02 

86,389 

6 

30,429,036 

rs3 132649 

6.4E-08 

6.5E-06 

2.1E-04 

132 

Pass 

6 

30,429,168 

rs3094061 

3.6E-06 

2.3E-05 

1.2E-03 

147 

Pass 

6 

30,429,315 

rs3 130374 

7.2E-06 

4.3E-05 

8.6E-04 

146 

Pass 

6 

30,429,461 

rs3094627 

5.0E-07 

7.5E-06 

5.2E-04 

6,710 

Pass 

6 

30,436,171 

rs31 30351 

1.2E-06 

2.5E-06 

1.8E-04 

165 

Pass 

6 

30,436,336 

rs3 130352 

1.1E-06 

1. IE-06 

1.2E-04 

1,609 

Pass 

6 

30,437,945 

rs3094057 

4.8E-06 

6.6E-06 

2.6E-04 

2,180 

Pass 

6 

30,440,125 

rs3094055 

4.3E-05 

7.1E-04 

2.1E-03 

409 

Pass 

6 

30,440,534 

rsl  01 2411 

1. IE-05 

6.8E-05 

1. IE-03 

9,957 

Pass 

6 

30,450,491 

rs3 132636 

9.4E-04 

1.6E-03 

3.9E-03 

739 

6 

30,451,230 

rs3129819 

7.2E-04 

1.2E-03 

4.5E-03 

3,988 

6 

30,455,218 

rs970269 

6.8E-04 

9.5E-04 

3.9E-03 

1,962 

6 

30,457,180 

rs3094623 

8.1E-04 

9.8E-04 

5.1E-03 

9,390 

6 

30,466,570 

rs3094050 

5.6E-11 

7.1E-09 

7.0E-06 

366 

Pass 

6 

30,466,936 

rs3094703 

4.0E-10 

2. IE-08 

1.7E-05 

4,394 

Pass 

6 

30,471,330 

rs3094034 

7.3E-07 

1.5E-06 

7.7E-05 

88,553 

Pass 

6 

30,559,883 

rs3094694 

1.4E-07 

1.3E-07 

6.7E-05 

176,178 

Pass 

6 

30,736,061 

rs3 130000 

3.3E-05 

1.3E-03 

1.6E-03 

65,734 

Pass 

6 

30,801 ,795 

rs3095329 

2.6E-08 

7.3E-07 

8.3E-06 

3,631 

Pass 

6 

30,805,426 

rs30941 27 

3.3E-09 

1.0E-07 

1.3E-06 

29,492 

Pass 

6 

30,834,918 

rs3095340 

1.8E-04 

1.1E-06 

4.8E-05 

1,421 

Pass 

6 

30,836,339 

rs30941 22 

1.2E-03 

4.8E-04 

1.5E-03 

18,856 

6 

30,855,195 

rsl 0947091 

6.4E-03 

2.7E-04 

2.3E-04 

7,324 

6 

30,862,519 

rsl 252741 5 

1.6E-03 

1.5E-04 

6.7E-05 

3,808 

Pass 

6 

30,866,327 

rsl  1758688 

1.7E-03 

1.4E-04 

5. IE-05 

1,677 

Pass 

6 

30,868,004 

rs3131050 

1.9E-09 

2. IE-09 

8.8E-07 

3,266 

Pass 

6 

30,871 ,270 

rs3131060 

2.5E-09 

4.1E-09 

1.2E-06 

3,654 

Pass 

6 

30,874,924 

rs4587207 

5.0E-02 

1.6E-03 

9.4E-04 

5,433 

6 

30,880,357 

rs30941 23 

2.3E-05 

1.9E-04 

3.7E-03 

936 

Pass 

6 

30,881 ,293 

rs47 13376 

3.3E-04 

6.7E-02 

2.5E-02 

16,294 

6 

30,897,587 

rsl 21 95469 

5.2E-04 

8.0E-02 

2.4E-02 

13,646 

6 

30,911,233 

rs3 130649 

8.5E-04 

4.9E-05 

4.8E-05 

14,612 

Pass 

6 

30,925,845 

rs3095350 

2. IE-03 

7.3E-05 

3.9E-05 

77,814 

Pass 

6 

31,003,659 

rs4711247 

4.8E-03 

1.3E-03 

6.9E-04 

6,215 

6 

31,009,874 

rs3131785 

1.8E-04 

3.1E-03 

3.8E-02 

2,053 

6 

31,011,927 

rs3131784 

1.8E-04 

3.7E-03 

4.9E-02 

36,439 

6 

31,048,366 

rs2530710 

1.9E-06 

1.5E-05 

8.6E-05 

32,202 

Pass 

6 

31,080,568 

rsl  63471 7 

2.8E-04 

1.3E-05 

4.9E-04 

276 

Pass 

6 

31,080,844 

rsl  63471 8 

1  .IE-06 

5.4E-08 

3.9E-05 

2,784 

Pass 

6 

31,083,628 

rsl  632854 

2.1E-04 

1.2E-05 

4.2E-04 

37,892 

Pass 

6 

31,121,520 

rs2517538 

3.4E-04 

1.2E-05 

3. IE-03 

4,907 

Pass 

6 

31,126,427 

rs2523865 

2. IE-09 

3.3E-10 

5.1E-07 

7,265 

Pass 

6 

31,133,692 

rs2517524 

4.6E-03 

1 .9E-02 

2.3E-04 

32,627 

6 

31,166,319 

rs3 130544 

6.2E-17 

1.7E-14 

5.4E-10 

22,131 

Pass 

6 

31,188,450 

rsl 265052 

2.6E-06 

6.5E-07 

6. IE-05 

1,367 

Pass 

6 

31,189,817 

rs3 130975 

9.3E-08 

2.0E-07 

2.2E-05 

1,975 

Pass 

6 

31,191,792 

rs31 30981 

5.8E-06 

1 .8  E -05 

1.6E-04 

3,320 

Pass 

6 

31,195,112 

rs3095324 

1.1E-07 

1.2E-07 

1.8E-05 

2,498 

Pass 

6 

31,197,610 

rs3095314 

2.9E-06 

2.9E-06 

1.2E-04 

7,552 

Pass 

6 

31,205,162 

rs3 130558 

5.3E-06 

6.5E-06 

7.3E-05 

4,491 

Pass 

6 

31,209,653 

rs3 130564 

4.6E-12 

1.0E-10 

1.4E-08 

28,404 

Pass 

HLA-E 


DPCR1 

DPCR1 

DPCR1 

DPCR1 


30,565,250 


31 ,027,538 
31,027,538 
31,027,538 
31 ,027,538 


30,569,077 


31,029,977 

31,029,977 

31,029,977 

31,029,977 


HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 

HLA  Region  HLA-E 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region  DPCR1 

HLA  Region  DPCR1 

HLA  Region  DPCR1 

HLA  Region  DPCR1 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 


31,238,057 

rs2073723 

5.5E-06 

1.2E-07 

3.3E-04 

3,431 

Pass 

31 ,241 ,488 

rs21 06074 

7.0E-08 

3.0E-08 

4.9E-06 

2,197 

Pass 

31,243,685 

rs9263804 

6. IE-06 

1.5E-07 

2.7E-04 

747 

Pass 

31 ,244,432 

rs31 30501 

4.4E-06 

9.7E-08 

2.6E-04 

261 

Pass 

31 ,244,693 

rs3 132524 

6.0E-06 

1.4E-07 

1.5E-04 

6,938 

Pass 

31,251,631 

rs887465 

8.6E-05 

1.7E-05 

1.9E-03 

12,133 

Pass 

31 ,263,764 

rs1265181 

3. IE-06 

4.6E-06 

7.4E-04 

5,425 

Pass 

31,269,189 

rs3095238 

2.5E-04 

2.8E-04 

1.9E-03 

35,461 

31,304,650 

rs3 130944 

9.1E-05 

2.0E-05 

2.2E-03 

8,946 

Pass 

HLA-C 

31,313,596 

rs3130713 

4.9E-05 

1.7E-05 

3. IE-03 

1,303 

Pass 

HLA-C 

31,314,899 

rs2394894 

9.3E-06 

4.0E-06 

3.6E-04 

59 

Pass 

HLA-C 

31,314,958 

rs2394895 

3.6E-05 

9.0E-06 

4.7E-04 

26,531 

Pass 

HLA-C 

31,341,489 

rs2248880 

1.3E-04 

1.1E-04 

2.9E-03 

9,660 

HLA-C 

31,351,149 

rs3 132486 

2.2E-16 

4.2E-12 

1.8E-10 

2,364 

Pass 

HLA-C 

31,353,513 

rsl 0456057 

8.8E-03 

3.1E-03 

6.3E-04 

202 

HLA-C 

31,353,715 

rs6906846 

3.8E-09 

1.2E-07 

2.7E-06 

85 

Pass 

HLA-C 

31,353,800 

rs2524067 

4.8E-16 

2.8E-13 

2.6E-09 

1,246 

Pass 

HLA-C 

31,355,046 

rs7382297 

6.0E-04 

3.6E-04 

3.0E-03 

89,183 

HLA-C 

31 ,444,229 

rs2523535 

2.5E-05 

2.1E-06 

1.7E-04 

99 

Pass 

HLA-B 

31,444,328 

rs2523534 

2.2E-06 

1.0E-06 

2.9E-04 

2,737 

Pass 

HLA-B 

31,447,065 

rs2596437 

4.8E-06 

6.5E-06 

6.2E-04 

4,518 

Pass 

HLA-B 

31,451,583 

rs5025315 

2.0E-06 

3.0E-06 

2.6E-04 

258 

Pass 

HLA-B 

31,451,841 

rs5022119 

8.5E-07 

2.1E-06 

2. IE-04 

411 

Pass 

HLA-B 

31,452,252 

rs2523638 

4.8E-08 

8.0E-06 

6.0E-04 

21 

Pass 

HLA-B 

31 ,452,273 

rs3997982 

3.7E-06 

2.5E-06 

2.9E-04 

3,728 

Pass 

HLA-B 

31,456,001 

rs2596571 

1.0E-06 

5.0E-05 

1.4E-03 

3,013 

Pass 

HLA-B 

31,459,014 

rs2523485 

1.5E-07 

6.4E-06 

4.2E-04 

1,845 

Pass 

HLA-B 

31,460,859 

rs9266774 

5.1E-04 

3.1E-02 

1 .6E-02 

7,215 

HLA-B 

31,468,074 

rs2596517 

3.7E-06 

3.3E-06 

4.1E-04 

40,840 

Pass 

HLA-B 

31,508,914 

rs2394999 

3.7E-02 

1 .8E-02 

4.6E-04 

12,026 

MICA 

31,520,940 

rs2596464 

8.5E-08 

1.7E-06 

2.4E-06 

12,538 

Pass 

MICA 

31,533,478 

rs3131621 

1.0E-06 

2.6E-06 

8.3E-06 

11,082 

Pass 

MICA 

31,544,560 

rs2844507 

7.1E-06 

9.1E-05 

2.9E-03 

10,129 

Pass 

MICB 

31,554,689 

rs2248459 

7.5E-15 

4.1E-11 

2.0E-09 

86 

Pass 

MICB 

31,554,775 

rs2248462 

1.0E-04 

4.7E-04 

6.4E-05 

1,737 

Pass 

MICB 

31,556,512 

rs2248617 

1.8E-14 

3.4E-11 

1.3E-09 

329 

Pass 

MICB 

31,556,841 

rs3749946 

2.5E-04 

2.3E-02 

1. IE-03 

114 

MICB 

31,556,955 

rs3099844 

2.6E-18 

6.9E-15 

2.0E-10 

105 

Pass 

MICB 

31,557,060 

rs2904776 

1.3E-04 

8.6E-04 

2.1E-03 

188 

MICB 

31,557,248 

rs2516422 

5. IE-06 

6.5E-05 

1.1E-05 

2,207 

Pass 

MICB 

31,559,455 

rs2905747 

1.6E-04 

2.5E-04 

1.2E-02 

8,667 

MICB 

31,568,122 

rs2395034 

1.3E-04 

6.9E-04 

4.0E-05 

125,951 

Pass 

MICB 

31,694,073 

rs2736177 

3.7E-10 

3.2E-08 

6.6E-06 

16,873 

Pass 

LTA 

31,710,946 

rsl 046089 

6. IE-15 

3.5E-11 

1.2E-08 

8,810 

Pass 

LST1 

31,719,756 

rs760293 

1.6E-08 

3.0E-06 

1.5E-04 

1,962 

Pass 

AIF1 

31,721,718 

rs3 130048 

1.4E-09 

4.7E-07 

7.9E-04 

4,382 

Pass 

AIF1 

31,726,100 

rs805301 

6.6E-15 

1.2E-11 

1.9E-08 

25,401 

Pass 

AIF1 

31,751,501 

rs9378164 

1 .4E-02 

4.8E-04 

2.4E-03 

77,511 

BAT2 

31,829,012 

rs3131379 

6.0E-17 

1.7E-13 

2.4E-09 

73,559 

Pass 

LY6G6C 

31,902,571 

rs2763979 

4.2E-09 

1.4E-08 

7.3E-07 

126,095 

Pass 

HSPA1 B 

32,028,666 

rs550513 

8.1E-04 

6.9E-04 

1 .2E-02 

12,474 

32,041,140 

rs406936 

3.3E-08 

1.9E-05 

1.8E-04 

357 

Pass 

32,041 ,497 

rs492899 

1.4E-05 

1.0E-04 

6.5E-03 

117,025 

Pass 

32,158,522 

rs3 130287 

2.6E-11 

1.5E-08 

5.7E-08 

9,323 

Pass 

32,167,845 

rsl  150753 

1.9E-21 

7.5E-15 

2.5E-10 

101,563 

Pass 

32,269,408 

rs204990 

2.6E-08 

1.8E-07 

4.9E-06 

422 

Pass 

NOTCH4 

32,269,830 

rs204989 

1.2E-06 

1.2E-06 

9.3E-05 

3,592 

Pass 

NOTCH4 

32,273,422 

rs2071278 

1.2E-11 

3. IE-09 

1.5E-06 

14,702 

Pass 

NOTCH4 

32,288,124 

rs3131294 

9.0E-13 

9.0E-09 

3.2E-08 

8,886 

Pass 

NOTCH4 

32,297,010 

rs4 15929 

8.0E-04 

9.7E-06 

2. IE-06 

10,112 

Pass 

NOTCH4 

32,307,122 

rs377763 

1.7E-18 

6.6E-14 

8.2E-10 

9,180 

Pass 

NOTCH4 

32,316,302 

rs424232 

4.9E-06 

9.4E-05 

1 .2E-02 

4,728 

Pass 

NOTCH4 

32,321 ,030 

rs9267954 

1.2E-04 

3.6E-03 

1.9E-04 

20,834 

NOTCH4 

32,341 ,864 

rs7767325 

5.9E-04 

2.9E-03 

1 .7E-02 

9,243 

NOTCH4 

32,351,107 

rsl 559873 

1.2E-04 

3.4E-03 

8.9E-03 

26 

32,351,133 

rsl 559874 

1.4E-04 

2.2E-03 

1 .0E-02 

37,307 

32,388,440 

rs9268207 

5.0E-05 

8.1E-04 

8.5E-03 

1,048 

Pass 

32,389,488 

rsl  01 8433 

1.7E-04 

2.5E-03 

1 .4E-02 

24,469 

32,413,957 

rs3 129900 

8.7E-15 

2.3E-09 

5.9E-10 

9,675 

Pass 

32,423,632 

rs9 10050 

4.6E-06 

3.1E-04 

1.6E-03 

73 

Pass 

BTNL2 

32,423,705 

rs9 10049 

8.2E-18 

7.7E-12 

7.9E-11 

10,108 

Pass 

BTNL2 

32,433,813 

rs2894249 

1.9E-18 

5.8E-12 

8.6E-11 

10,292 

Pass 

BTNL2 

32,444,105 

rs3 129932 

1.6E-17 

1.6E-11 

8.6E-11 

34 

Pass 

BTNL2 

32,444,139 

rs3 129933 

1.3E-06 

2.5E-05 

3.4E-05 

26 

Pass 

BTNL2 

32,444,165 

rs3 129934 

3.8E-15 

7.2E-10 

2.3E-10 

5,286 

Pass 

BTNL2 

32,449,451 

rs9268403 

1.9E-08 

7.0E-08 

3.6E-08 

4,216 

Pass 

BTNL2 

32,453,667 

rs2894254 

6.2E-20 

2.7E-15 

6.9E-11 

17,838 

Pass 

BTNL2 

32,471 ,505 

rs2076533 

2.0E-04 

4.3E-04 

1.8E-05 

289 

Pass 

BTNL2 

32,471 ,794 

rs2076530 

1.7E-04 

3.6E-04 

2.7E-05 

28 

Pass 

BTNL2 

32,471 ,822 

rs9268480 

7.3E-09 

3.3E-08 

1.6E-08 

9,341 

Pass 

BTNL2 

32,481,163 

rsl 0947260 

3.0E-05 

2.6E-05 

9.9E-06 

47 

Pass 

BTNL2 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

31 ,344,508 

31,347,834  Strong 

Majorcan,  Spanish 

HLA  Region 

HLA-C 

31,344,508 

31,347,834  Strong 

Majorcan,  Spanish 

HLA  Region 

HLA-C 

31 ,344,508 

31,347,834  Strong 

Majorcan,  Spanish 

HLA  Region 

HLA-C 

31 ,344,508 

31,347,834  Strong 

Majorcan,  Spanish 

HLA  Region 

HLA-C 

31 ,344,508 

31,347,834  Strong 

Majorcan,  Spanish 

HLA  Region 

HLA-C 

31 ,344,508 

31,347,834  Strong 

Majorcan,  Spanish 

HLA  Region 

HLA-C 

31 ,344,508 

31,347,834  Strong 

Majorcan,  Spanish 

HLA  Region 

HLA-C 

31 ,344,508 

31,347,834  Strong 

Majorcan,  Spanish 

HLA  Region 

HLA-C 

31 ,344,508 

31,347,834  Strong 

Majorcan,  Spanish 

HLA  Region 

HLA-C 

31 ,344,508 

31,347,834  Strong 

Majorcan,  Spanish 

HLA  Region 

HLA-C 

31 ,429,628 

31,432,914  Mixed 

Brazilian,  Spanish 

HLA  Region 

HLA-B 

31 ,429,628 

31,432,914  Mixed 

Brazilian,  Spanish 

HLA  Region 

HLA-B 

31 ,429,628 

31,432,914  Mixed 

Brazilian,  Spanish 

HLA  Region 

HLA-B 

31 ,429,628 

31,432,914  Mixed 

Brazilian,  Spanish 

HLA  Region 

HLA-B 

31 ,429,628 

31,432,914  Mixed 

Brazilian,  Spanish 

HLA  Region 

HLA-B 

31 ,429,628 

31,432,914  Mixed 

Brazilian,  Spanish 

HLA  Region 

HLA-B 

31 ,429,628 

31,432,914  Mixed 

Brazilian,  Spanish 

HLA  Region 

HLA-B 

31 ,429,628 

31,432,914  Mixed 

Brazilian,  Spanish 

HLA  Region 

HLA-B 

31 ,429,628 

31,432,914  Mixed 

Brazilian,  Spanish 

HLA  Region 

HLA-B 

31 ,429,628 

31,432,914  Mixed 

Brazilian,  Spanish 

HLA  Region 

HLA-B 

31 ,429,628 

31,432,914  Mixed 

Brazilian,  Spanish 

HLA  Region 

HLA-B 

31,479,350 

31,491,069  Strong 

Eastern  India,  Basques,  Korean,  Swedish  Caucasians,  Czec  HLA  Region 

MICA 

31,479,350 

31,491,069  Strong 

Eastern  India,  Basques,  Korean,  Swedish  Caucasians,  Czec  HLA  Region 

MICA 

31,479,350 

31,491,069  Strong 

Eastern  India,  Basques,  Korean,  Swedish  Caucasians,  Czec  HLA  Region 

MICA 

31,573,945 

31,586,880 

HLA  Region 

MICB 

31,573,945 

31,586,880 

HLA  Region 

MICB 

31,573,945 

31,586,880 

HLA  Region 

MICB 

31,573,945 

31,586,880 

HLA  Region 

MICB 

31,573,945 

31,586,880 

HLA  Region 

MICB 

31,573,945 

31,586,880 

HLA  Region 

MICB 

31,573,945 

31,586,880 

HLA  Region 

MICB 

31,573,945 

31,586,880 

HLA  Region 

MICB 

31,573,945 

31,586,880 

HLA  Region 

MICB 

31,573,945 

31,586,880 

HLA  Region 

MICB 

31,648,072 

31,650,077  Strong 

Caucasian,  Finnish,  Japanese,  North  Indian  Asian 

HLA  Region 

LTA 

31,661,950 

31,664,665  Mixed 

HLA  Region 

LST1 

31,691,012 

31,692,777  Weak 

HLA  Region 

AIF1 

31,691,012 

31,692,777  Weak 

HLA  Region 

AIF1 

31,691,012 

31,692,777  Weak 

HLA  Region 

AIF1 

31,696,576 

31,713,516  Strong 

Spanish 

HLA  Region 

BAT2 

31 ,794,404 

31,797,489 

HLA  Region 

LY6G6C 

31,903,503 

31,906,010  Weak 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HSPA1 B 

32,270,598 

32,299,822 

HLA  Region 

NOTCH4 

32,270,598 

32,299,822 

HLA  Region 

NOTCH4 

32,270,598 

32,299,822 

HLA  Region 

NOTCH4 

32,270,598 

32,299,822 

HLA  Region 

NOTCH4 

32,270,598 

32,299,822 

HLA  Region 

NOTCH4 

32,270,598 

32,299,822 

HLA  Region 

NOTCH4 

32,270,598 

32,299,822 

HLA  Region 

NOTCH4 

32,270,598 

32,299,822 

HLA  Region 

NOTCH4 

32,270,598 

32,299,822 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

HLA  Region 

NOTCH4 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,481,210 

rsl 0947261 

5.9E-05 

3.4E-05 

3.8E-05 

1,390 

Pass 

BTNL2 

32,482,600 

rs3763307 

6.3E-09 

2.7E-08 

1 .3  E -08 

8,788 

Pass 

BTNL2 

32,491 ,388 

rs6930933 

3.2E-08 

2.8E-06 

5.2E-07 

448 

Pass 

BTNL2 

32,491 ,836 

rs2001097 

3.8E-08 

2.8E-06 

3.8E-07 

669 

Pass 

BTNL2 

32,492,505 

rs9268541 

7.7E-04 

3.9E-02 

3.2E-01 

572 

BTNL2 

32,493,077 

rs3 135378 

3. IE-08 

3.3E-06 

5.2E-07 

300 

Pass 

BTNL2 

32,493,377 

rs3 135377 

5.0E-18 

2. IE-11 

1.7E-10 

71 

Pass 

BTNL2 

32,493,448 

rs3 135376 

3.9E-08 

3.5E-06 

5.2E-07 

2,282 

Pass 

BTNL2 

32,495,730 

rs2395161 

2.4E-08 

3.3E-06 

4.8E-07 

108 

Pass 

BTNL2 

32,495,838 

rs2395164 

1.4E-09 

6.0E-07 

1.8E-07 

448 

Pass 

BTNL2 

32,496,286 

rs2395167 

1.8E-07 

2.2E-06 

6.0E-07 

266 

Pass 

BTNL2 

32,496,552 

rs22 13580 

2.6E-07 

2.0E-05 

2.7E-06 

938 

Pass 

BTNL2 

32,497,490 

rs9268560 

1.2E-05 

2.1E-04 

3.4E-03 

7,103 

Pass 

BTNL2 

32,504,593 

rs3 135342 

2.9E-06 

6.9E-05 

3.5E-06 

7,520 

Pass 

BTNL2 

32,512,113 

rs5000563 

2.8E-06 

6.7E-05 

1.3E-06 

3,018 

Pass 

BTNL2 

32,515,131 

rs3 129872 

1.8E-06 

5.7E-05 

1.2E-06 

1,374 

Pass 

BTNL2 

32,516,505 

rs9268645 

1. IE-06 

1.6E-05 

1.6E-04 

70 

Pass 

BTNL2 

32,516,575 

rs3 129877 

6.5E-06 

9.4E-05 

1.9E-06 

245 

Pass 

BTNL2 

32,516,820 

rs3 135393 

6. IE-08 

6.0E-06 

4.9E-07 

3,750 

Pass 

BTNL2 

32,520,570 

rsl  051 336 

1.8E-07 

8.2E-06 

1.7E-06 

17,051 

Pass 

BTNL2 

32,537,621 

rs9268853 

1.3E-13 

6. IE-13 

5.9E-11 

115 

Pass 

HLA-DRA 

32,537,736 

rs9268858 

5.0E-13 

7.9E-13 

3.0E-10 

1,389 

Pass 

HLA-DRA 

32,539,125 

rs9268877 

9. IE-19 

8.9E-16 

2.6E-12 

113,205 

Pass 

HLA-DRA 

32,652,330 

rs2027852 

1.4E-33 

0.0E  +  00 

4.6E-14 

29,708 

Fail 

HLA-DRB5 

32,682,038 

rs9270986 

5.8E-16 

2.2E-10 

5.2E-11 

21,023 

Pass 

HLA-DRB1 

32,703,061 

rs3 129768 

1.2E-21 

8.7E-14 

1.7E-13 

7,186 

Pass 

HLA-DRB1 

32,710,247 

rs9272219 

9.3E-05 

3.1E-03 

5.4E-02 

7,158 

Pass 

HLA-DRB1 

32,717,405 

rs9272723 

9.0E-27 

0.0E  +  00 

5.8E-14 

16,845 

Pass 

HLA-DQA1 

32,734,250 

rs9273363 

7.1E-89 

0.0E  +  00 

0.0E  +  00 

28,368 

Pass 

HLA-DQA1 

32,762,618 

rs2856688 

2.9E-17 

9.8E-15 

3.0E-10 

3,439 

Pass 

HLA-DQA1 

32,766,057 

rs7775228 

8.0E-06 

5.6E-06 

1.5E-05 

231 

Pass 

HLA-DQA1 

32,766,288 

rs9469220 

4.9E-19 

4.4E-16 

6.6E-13 

11,690 

Pass 

HLA-DQA1 

32,777,978 

rs2858308 

2.2E-05 

2.9E-05 

2.4E-04 

14,029 

Pass 

HLA-DQB1 

32,792,007 

rs6936863 

7.0E-17 

3.4E-14 

2.4E-11 

1,521 

Pass 

HLA-DQB1 

32,793,528 

rs3916765 

5.2E-30 

0.0E  +  00 

6.7E-14 

3,774 

Pass 

32,797,302 

rs9275765 

4.2E-07 

2.1E-07 

8.5E-06 

179 

Pass 

32,797,481 

rs9275772 

4.1E-07 

2.4E-07 

1.1E-05 

26 

Pass 

32,797,507 

rs9461799 

7.9E-10 

8.8E-11 

2.2E-09 

498 

Pass 

32,798,005 

rs9275793 

3.7E-07 

1.3E-07 

4.7E-06 

21,755 

Pass 

32,819,760 

rs2227127 

9.0E-11 

2.2E-11 

1.7E-10 

322 

Pass 

32,820,082 

rs9276429 

1.2E-10 

1.7E-10 

1.6E-09 

143 

Pass 

32,820,225 

rs9276431 

1.2E-10 

1.9E-10 

1.6E-09 

137 

Pass 

32,820,362 

rs9276432 

1.2E-10 

1.7E-10 

1.6E-09 

883 

Pass 

32,821 ,245 

rs2239800 

2.3E-06 

1.6E-07 

6.4E-08 

600 

Pass 

32,821 ,845 

rs9276435 

7.8E-04 

7.9E-03 

1.1E-03 

916 

32,822,761 

rs9276440 

3.9E-10 

7.5E-10 

3.6E-09 

15,038 

Pass 

32,837,799 

rs7768538 

2.0E-10 

1. IE-10 

7.9E-10 

191 

Pass 

32,837,990 

rs7453920 

1.6E-10 

1.2E-10 

1.4E-09 

1,948 

Pass 

32,839,938 

rs6902723 

2.9E-08 

3.0E-07 

6.3E-08 

250 

Pass 

HLA-DOB 

32,840,188 

rs6903130 

1.0E-07 

7.3E-07 

1.3E-07 

3,934 

Pass 

HLA-DOB 

32,844,122 

rs9296044 

2.8E-09 

1.5E-09 

3.7E-10 

9,383 

Pass 

HLA-DOB 

32,853,505 

rsl 2661 352 

7.0E-04 

2.3E-05 

2.2E-02 

6,087 

Fail 

HLA-DOB 

32,859,592 

rsl  74291 27 

4.0E-04 

3.4E-05 

8.5E-03 

29,940 

Pass 

HLA-DOB 

32,889,532 

rs2070121 

1.9E-04 

7.0E-04 

4.9E-04 

595 

HLA-DOB 

32,890,127 

rsl  7501 267 

8.7E-04 

1.9E-03 

5.2E-03 

433 

HLA-DOB 

32,890,560 

rs2071474 

2.6E-07 

3.1E-09 

8.0E-07 

4,454 

Pass 

HLA-DOB 

32,895,014 

rsl 894407 

3.4E-02 

1.2E-04 

2.9E-03 

139 

HLA-DOB 

32,895,153 

rs9784858 

7.4E-06 

5.3E-04 

8.1E-03 

7,857 

Pass 

HLA-DOB 

32,903,010 

rsl 0484565 

6.0E-05 

5.0E-03 

2.4E-02 

8,808 

Pass 

HLA-DOB 

32,911,818 

rs241429 

1.2E-02 

1.7E-04 

1.4E-03 

574 

HLA-DOB 

32,912,392 

rs241427 

1.8E-13 

6.6E-11 

9.4E-08 

62,578 

Pass 

HLA-DOB 

32,974,970 

rs241403 

3. IE-09 

3.1E-07 

6. IE-07 

3,065 

Pass 

TAPI 

32,978,035 

rs3101942 

4.8E-12 

6.2E-10 

3.5E-08 

33,843 

Pass 

TAPI 

33,011,878 

rsl  51 71 9 

8.4E-10 

6.9E-07 

1.3E-05 

13,957 

Pass 

HLA-DMB 

33,025,835 

rsl 050391 

9.0E-05 

4.0E-04 

4.3E-03 

123 

Pass 

HLA-DMB 

33,025,958 

rsl  153921 6 

1.6E-04 

2.9E-03 

1 .7E-02 

20,219 

HLA-DMB 

33,046,177 

rsl 78401 86 

4.9E-04 

1 .5E-02 

3.7E-02 

27,743 

HLA-DMB 

33,073,920 

rsl  76248 

1.8E-06 

7.1E-08 

4.0E-06 

7,801 

Pass 

HLA-DMA 

33,081 ,721 

rs3 129304 

1.2E-05 

4.5E-06 

1.3E-03 

135 

Pass 

33,081 ,856 

rs3 129303 

1.0E-05 

3.0E-06 

8. IE-04 

4,709 

Pass 

33,086,565 

rs429916 

1.8E-08 

2.8E-07 

3.6E-05 

18,990 

Pass 

33,105,555 

rs3135196 

2.0E-05 

5.7E-06 

2.2E-03 

27,029 

Pass 

HLA-DPA1 

33,132,584 

rs376877 

1.9E-05 

2.0E-05 

3.6E-03 

22,425 

Pass 

HLA-DPA1 

33,155,009 

rsl  431 403 

1.1E-04 

2.1E-06 

3.1E-04 

8,216 

Pass 

HLA-DPA1 

33,163,225 

rs9277542 

6.0E-05 

3.9E-06 

2.2E-04 

533 

Pass 

HLA-DPA1 

33,163,758 

rs3 128963 

3.9E-04 

1.9E-05 

6.5E-04 

166 

Pass 

HLA-DPA1 

33,163,924 

rs3 128966 

5.9E-03 

7.5E-04 

1 .4E-02 

2,928 

HLA-DPA1 

33,166,852 

rs21 79920 

2. IE-06 

3.5E-07 

3.5E-05 

11,875 

Pass 

HLA-DPA1 

33,178,727 

rs31 28921 

2.5E-04 

7.0E-06 

3.4E-04 

573 

Pass 

HLA-DPA1 

33,179,300 

rs3 128923 

1.2E-04 

8.0E-06 

1.2E-04 

4,313 

Pass 

HLA-DPA1 

33,183,613 

rs3117230 

9.3E-07 

1.8E-07 

1.3E-05 

31 

Pass 

HLA-DPA1 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,470,491 

32,482,878 

HLA  Region 

BTNL2 

32,515,625 

32,520,801  Strong 

HLA  Region 

HLA-DRA 

32,515,625 

32,520,801  Strong 

HLA  Region 

HLA-DRA 

32,515,625 

32,520,801  Strong 

HLA  Region 

HLA-DRA 

32,593,129 

32,605,984  Mixed 

Tunisian,  Trinidadian,  Thai  HLA  Region 

HLA-DRB5 

32,654,527 

32,665,559  Mixed 

African,  Japanese,  Caucasian,  Icelandic,  Indian,  Majorcan,  C  HLA  Region 

HLA-DRB1 

32,654,527 

32,665,559  Mixed 

African,  Japanese,  Caucasian,  Icelandic,  Indian,  Majorcan,  C  HLA  Region 

HLA-DRB1 

32,654,527 

32,665,559  Mixed 

African,  Japanese,  Caucasian,  Icelandic,  Indian,  Majorcan,  C  HLA  Region 

HLA-DRB1 

32,713,161 

32,719,407  Strong 

Russian,  African,  Taiwanese,  European  American,  Indian,  lc(  HLA  Region 

HLA-DQA1 

32,713,161 

32,719,407  Strong 

Russian,  African,  Taiwanese,  European  American,  Indian,  lc«  HLA  Region 

HLA-DQA1 

32,713,161 

32,719,407  Strong 

Russian,  African,  Taiwanese,  European  American,  Indian,  lc«  HLA  Region 

HLA-DQA1 

32,713,161 

32,719,407  Strong 

Russian,  African,  Taiwanese,  European  American,  Indian,  lc«  HLA  Region 

HLA-DQA1 

32,713,161 

32,719,407  Strong 

Russian,  African,  Taiwanese,  European  American,  Indian,  lc<  HLA  Region 

HLA-DQA1 

32,735,642 

32,742,419  Strong 

Japanese,  Mixageneted  population  (Caucasian,  african,  ame  HLA  Region 

HLA-DQB1 

32,735,642 

32,742,419  Strong 

Japanese,  Mixageneted  population  (Caucasian,  african,  ame  HLA  Region 

HLA-DQB1 

32, 

888,527 

32, 

,892,762 

32, 

,888,527 

32, 

,892,762 

32, 

,888,527 

32, 

,892,762 

32, 

,888,527 

32, 

,892,762 

32, 

,888,527 

32, 

,892,762 

32, 

,888,527 

32, 

,892,762 

32, 

,888,527 

32, 

,892,762 

32, 

,888,527 

32, 

,892,762 

32, 

,888,527 

32, 

,892,762 

32, 

,888,527 

32, 

,892,762 

32, 

,888,527 

32, 

,892,762 

32, 

,888,527 

32, 

,892,762 

32, 

,888,527 

32, 

,892,762 

32, 

,920,964 

32, 

,929,726 

Weak 

32, 

,920,964 

32, 

,929,726 

Weak 

33, 

,010,393 

33, 

,016,795 

Weak 

33, 

,010,393 

33, 

,016,795 

Weak 

33, 

,010,393 

33, 

,016,795 

Weak 

33, 

,010,393 

33, 

,016,795 

Weak 

33, 

,024,373 

33, 

,028,831 

33, 

,140,772 

33, 

,149,356 

Mixed 

33, 

,140,772 

33, 

,149,356 

Mixed 

33, 

,140,772 

33, 

,149,356 

Mixed 

33, 

,140,772 

33, 

,149,356 

Mixed 

33, 

,140,772 

33, 

,149,356 

Mixed 

33, 

,140,772 

33, 

,149,356 

Mixed 

33, 

,140,772 

33, 

,149,356 

Mixed 

33, 

,140,772 

33, 

,149,356 

Mixed 

33, 

,140,772 

33, 

,149,356 

Mixed 

33, 

,140,772 

33, 

,149,356 

Mixed 

Finnish,  German,  Japanese  and  Caucasian 
Finnish,  German,  Japanese  and  Caucasian 


HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 
HLA  Region 


HLA-DOB 

HLA-DOB 

HLA-DOB 

HLA-DOB 

HLA-DOB 

HLA-DOB 

HLA-DOB 

HLA-DOB 

HLA-DOB 

HLA-DOB 

HLA-DOB 

HLA-DOB 

HLA-DOB 

TAPI 

TAPI 

HLA-DMB 

HLA-DMB 

HLA-DMB 

HLA-DMB 

HLA-DMA 


HLA-DPA1 

HLA-DPA1 

HLA-DPA1 

HLA-DPA1 

HLA-DPA1 

HLA-DPA1 

HLA-DPA1 

HLA-DPA1 

HLA-DPA1 

HLA-DPA1 


6 

33,183,644 

rs3 128930 

1.5E-06 

2.7E-07 

1.6E-05 

424 

Pass 

6 

33,184,068 

rs872956 

1.2E-06 

5.1E-07 

3.3E-06 

78,566 

Pass 

6 

33,262,634 

rs2855459 

2.1E-02 

5.1E-04 

1 .2E-02 

196,595 

6 

33,459,229 

rs2747476 

4.1E-03 

6.6E-04 

1.8E-01 

186,551 

6 

33,645,780 

rsl 7627049 

1 .7E-02 

2.8E-04 

1 .2E-02 

4,721 

6 

33,650,501 

rs9296095 

2.2E-04 

3.1E-03 

1 .0E-02 

32,486 

6 

33,682,987 

rs444697 

2.5E-04 

3.9E-05 

5.1E-04 

2,373,994 

Pass 

6 

36,056,981 

rsl 7656051 

2.1E-04 

1.1E-03 

8.9E-04 

4,466,734 

6 

40,523,715 

rsl 0947890 

1.5E-02 

1 .2E-02 

4.8E-04 

5,600 

6 

40,529,315 

rs3943783 

4.3E-03 

6.3E-03 

4.5E-04 

3,343 

6 

40,532,658 

rs9471347 

9.2E-03 

8.2E-03 

3.4E-04 

2,155 

6 

40,534,813 

rsl 433728 

1.7E-03 

9.0E-03 

2.3E-04 

18 

6 

40,534,831 

rs2436731 

1.2E-02 

8.7E-03 

4.4E-04 

728 

6 

40,535,559 

rs987065 

8.5E-03 

1 .9E-02 

6.2E-04 

3,423 

6 

40,538,982 

rsl 433725 

1 .3E-02 

1.1E-02 

4.0E-04 

304 

6 

40,539,286 

rsl 021 4422 

1 .7E-02 

1 .2E-02 

7.3E-04 

1,813,322 

6 

42,352,608 

rs47 14586 

6.6E-04 

1.5E-01 

1.3E-01 

688,085 

6 

43,040,693 

rs2274517 

2.6E-04 

1.3E-03 

1.2E-04 

1 ,727,087 

6 

44,767,780 

rs489757 

1.5E-12 

6.8E-07 

6.2E-07 

1,119,930 

Fail 

6 

45,887,710 

rs9395123 

3.0E-04 

7.7E-03 

8.9E-03 

931 ,273 

6 

46,818,983 

rsl  221 1537 

2.7E-04 

1.5E-03 

6.9E-05 

1,814,666 

Pass 

6 

48,633,649 

rs6910476 

8.9E-04 

1 .3E-02 

2.4E-03 

153 

6 

48,633,802 

rs6458618 

4.8E-04 

3.2E-03 

1. IE-03 

28,828 

6 

48,662,630 

rs9369793 

9.7E-04 

9.4E-03 

2.9E-03 

743,379 

6 

49,406,009 

rs542538 

1 .0E-02 

7.8E-04 

7.1E-03 

2,914,138 

6 

52,320,147 

rs537288 

1.3E-04 

3.1E-03 

7.8E-03 

1,617 

6 

52,321 ,764 

rs47 15297 

3.8E-04 

7.6E-03 

1.1E-02 

89,486 

6 

52,411,250 

rs3804506 

2.1E-04 

1.1E-03 

1. IE-03 

486,537 

6 

52,897,787 

rs651733 

7.5E-22 

1.0E-12 

5.9E-13 

19,328,808 

Fail 

6 

72,226,595 

rs9351813 

1 .0E-02 

8.3E-04 

1 .0E-02 

31,884 

6 

72,258,479 

rs9351816 

2.7E-03 

3.0E-04 

2.9E-03 

3,550,686 

6 

75,809,165 

rs4569931 

4.5E-04 

3.5E-03 

5.3E-02 

125,740 

6 

75,934,905 

rsl 778361 5 

8.9E-04 

6.6E-04 

7.5E-03 

45,420 

6 

75,980,325 

rsl 323070 

7.4E-04 

1 .8E-01 

1.2E-01 

115,289 

6 

76,095,614 

rsl 6886475 

6.0E-05 

2.5E-03 

3.5E-04 

417,154 

Pass 

6 

76,512,768 

rs276699 

3.6E-02 

3.3E-02 

7.2E-04 

713,739 

6 

77,226,507 

rsl  490208 

1.3E-01 

2.2E-01 

7.4E-04 

7,278 

6 

77,233,785 

rs7740045 

1.3E-01 

2.3E-01 

8.5E-04 

3,262,617 

6 

80,496,402 

rs9361552 

4.8E-04 

5.5E-03 

1.6E-01 

1,792 

6 

80,498,194 

rs9443691 

5.5E-04 

6. IE-03 

1.6E-01 

5,244,595 

6 

85,742,789 

rsl 322872 

4.8E-03 

6.3E-05 

2.5E-04 

5,016,370 

Pass 

6 

90,759,159 

rs9353711 

2.3E-03 

2.1E-02 

4.3E-04 

7,091,598 

6 

97,850,757 

rsl 7806588 

1. IE-04 

4.5E-04 

1.9E-03 

4,465,017 

6 

102,315,774 

rs9390779 

4.0E-03 

5.8E-04 

1  .IE-01 

1,791,613 

6 

104,107,387 

rs2506743 

5.6E-04 

7.1E-04 

1.1E-02 

12,353,897 

6 

116,461,284 

rsl 11 53594 

2.2E-03 

8.5E-04 

7.9E-03 

1,196,003 

6 

117,657,287 

rs783199 

2.1E-02 

8.0E-04 

4.7E-02 

476,145 

6 

118,133,432 

rs2498587 

2. IE-04 

3.1E-03 

5.7E-03 

1 ,023,784 

6 

119,157,216 

rsl  68861 8 

7.9E-03 

1.3E-04 

5.9E-02 

10,627 

6 

119,167,843 

rs947138 

1 .7E-02 

2.5E-04 

4.3E-02 

4,139 

6 

119,171,982 

rsl  41 761 3 

1.0E-02 

1.1E-04 

1 .3E-02 

9,750,589 

6 

128,922,571 

rs7738739 

1. IE-03 

8.4E-04 

1 .8E-02 

3,933,122 

6 

132,855,693 

rs4895939 

3.5E-04 

2.6E-03 

2.6E-02 

1 ,577,373 

6 

134,433,066 

rs7738638 

1.4E-03 

1.3E-03 

8.3E-04 

7,800 

6 

134,440,866 

rs6906278 

4.5E-04 

1.6E-03 

4.2E-04 

3,473,721 

6 

137,914,587 

rs9689944 

7.2E-05 

2.0E-05 

5.5E-05 

494,666 

Fail 

6 

138,409,253 

rs9376318 

6.5E-05 

7.7E-06 

2.8E-05 

5,008,566 

Pass 

6 

143,417,819 

rs4896618 

9.9E-04 

3.0E-03 

1.8E-04 

3,937,016 

6 

147,354,835 

rs9322085 

2.7E-04 

2.1E-04 

4.1E-04 

38,160 

6 

147,392,995 

rsl  52901 7 

2.6E-03 

9.3E-04 

3.3E-04 

19,763 

6 

147,412,758 

rs6919814 

3.9E-03 

2.3E-04 

1.8E-03 

33 

6 

147,412,791 

rs6920096 

4.3E-03 

2.0E-04 

1.5E-03 

7,261 

6 

147,420,052 

rsl  881 660 

7.4E-03 

4.9E-04 

7.1E-03 

18,386 

6 

147,438,438 

rs6930740 

8.8E-03 

6.1E-04 

5.4E-03 

3,382,452 

6 

150,820,890 

rs4870524 

4.2E-02 

3.4E-04 

5.1E-02 

351 ,884 

6 

151,172,774 

rs782661 

2.8E-02 

2.3E-02 

6.3E-04 

109,995 

6 

151,282,769 

rs9371203 

4.4E-05 

3.9E-05 

3.8E-04 

4,900,120 

Fail 

6 

156,182,889 

rs9371392 

2.7E-02 

1 .3E-02 

8.5E-04 

6,204 

6 

156,189,093 

rs9384368 

1 .7E-01 

6.4E-02 

1.6E-04 

60,588 

6 

156,249,681 

rs4495282 

1.1E-03 

9.0E-04 

6.4E-04 

4,248,580 

6 

160,498,261 

rs9295125 

4.2E-02 

2.3E-01 

8.5E-04 

881 ,394 

6 

161,379,655 

rs6455701 

1.8E-03 

1 .9E-02 

6.6E-04 

90,541 

6 

161,470,196 

rs4709495 

1.1E-02 

5.6E-02 

9.2E-04 

201 

6 

161,470,397 

rs4709496 

4.9E-02 

2.1E-01 

9.1E-04 

368,807 

6 

161,839,204 

rsl  21 91 995 

7.8E-04 

4.0E-03 

2.0E-02 

36,791 

6 

161,875,995 

rsl  21 95005 

3.6E-04 

5.0E-03 

1 .4E-02 

4,829 

6 

161,880,824 

rsl 7651 062 

9. IE-05 

6.5E-04 

6.6E-03 

5,759,491 

Pass 

6 

167,640,315 

rs4709162 

8. IE-04 

1.4E-03 

7.8E-02 

105,546 

6 

167,745,861 

rs4075645 

1.4E-04 

6.3E-03 

5.6E-02 

7 

7,332,182 

rsl  294610 

6.9E-05 

5.9E-04 

5.6E-05 

1 ,087,287 

Poor 

7 

8,419,469 

rsl  007095 

2.5E-03 

3.3E-04 

3.3E-02 

1,188,983 

HLA-DPA1 

HLA-DPA1 


ITPR3 

ITPR3 

MAPK14 


33,140,772 

33,140,772 


33,149,356  Mixed 
33,149,356  Mixed 


33,697,322 

33,697,322 

36,103,551 


33,772,317  Strong 
33,772,317  Strong 
36,186,513 


HLA  Region 
HLA  Region 
HLA  Region 


ITPR3 


4,721 

32,486 


ITPR3 

ITPR3 


5,600 

3,343 

2,155 

18 

728 

3,423 

304 


Intergenic 

PLA2G7 

153 

28,828 


GSTA3 


FILIP1 


Intergenic 


10,627 

4,139 


LOC391040 

PERP 


38,160 

19,763 

33 

7,261 

18,386 


MTHFD1L 


PARK2 


36,791 

4,829 


HLA-DPA1 

HLA-DPA1 


ITPR3 

ITPR3 

MAPK14 

LRFN2 

LRFN2 

LRFN2 

LRFN2 

LRFN2 

LRFN2 

LRFN2 

LRFN2 


Intergenic 

PLA2G7 

Intergenic 

Intergenic 

Intergenic 


GSTA3 


FILIP1 


Intergenic 


Intergenic 

Intergenic 

Intergenic 


LOC391040 

PERP 

LOC7291 78 
LOC729178 
LOC729178 
LOC729178 
LOC729178 
LOC7291 78 


MTHFD1L 


PARK2 

PARK2 

PARK2 


COL28A1 


COL28A1 


7 

9,608,452 

rsl 0253648 

2.7E-03 

1.2E-04 

1.1E-02 

185 

7 

9,608,637 

rsl  02671 11 

3.4E-03 

1.3E-04 

1 .2E-02 

1 ,686,889 

7 

11,295,526 

rs2 189555 

3.9E-04 

1.4E-03 

2.1E-02 

645,450 

7 

11,940,976 

rs4291160 

1.3E-02 

7.0E-04 

2. IE-03 

3,140,475 

7 

15,081,451 

rs6971261 

9.0E-04 

2.5E-04 

6.0E-03 

14,472 

7 

15,095,923 

rs9638721 

5.5E-04 

3.9E-03 

6.1E-03 

3,729 

7 

15,099,652 

rsl 2699683 

3.8E-04 

1.0E-03 

4.0E-03 

344,538 

7 

15,444,190 

rsl  269971 4 

7.4E-04 

1.1E-02 

7.6E-02 

4,077,453 

7 

19,521,643 

rs2024347 

3.5E-05 

2.8E-05 

2.1E-04 

773 

Pass 

7 

19,522,416 

rs2 192486 

1.8E-04 

1.1E-04 

8.3E-04 

17,872 

7 

19,540,288 

rs985307 

7.0E-04 

5.5E-05 

1.3E-04 

303 

Pass 

7 

19,540,591 

rs985306 

6.0E-04 

3.0E-05 

1.4E-04 

611 

Pass 

7 

19,541,202 

rs21 92481 

1.4E-04 

4.6E-05 

4.9E-04 

828 

Pass 

7 

19,542,030 

rs985882 

1.2E-03 

1.3E-04 

2.5E-04 

50 

7 

19,542,080 

rs985881 

6.9E-04 

4.8E-05 

3.9E-05 

18,609 

Pass 

7 

19,560,689 

rs2 192477 

3.6E-05 

3.0E-05 

2.8E-04 

261 

Pass 

7 

19,560,950 

rs4721790 

5.8E-04 

7.7E-04 

2.1E-04 

699 

7 

19,561,649 

CG07003679 

5.4E-04 

1.8E-04 

1.1E-04 

16,338 

7 

19,577,987 

rsl 024881 4 

1.0E-03 

2.9E-04 

1.5E-04 

4,298 

7 

19,582,285 

rs7802880 

9.0E-05 

6.1E-05 

8.8E-04 

4,891 

Pass 

7 

19,587,176 

rsl 7451 868 

8.6E-05 

4.6E-05 

5.3E-04 

1,252 

Pass 

7 

19,588,428 

rsl  121 732 

1.6E-04 

4.9E-05 

2.5E-04 

32,298 

Pass 

7 

19,620,726 

rs4307235 

1.3E-04 

3.3E-04 

8.5E-04 

4,582,554 

7 

24,203,280 

rsl  56293 

4.0E-04 

1 .2E-02 

1.9E-03 

20,951 

7 

24,224,231 

rsl  621 7 

6.9E-04 

3.9E-02 

9.9E-04 

2,719,486 

7 

26,943,717 

rs774257 

4.5E-04 

4.8E-02 

5.4E-02 

38,487 

7 

26,982,204 

rs774250 

4.8E-04 

6.1E-02 

7.4E-02 

101,069 

7 

27,083,273 

rs6953314 

6.9E-04 

9.8E-02 

1  .IE-01 

1,910,664 

7 

28,993,937 

rs3 17768 

5.2E-02 

1.2E-03 

5.6E-04 

12,311,568 

7 

41,305,505 

rs37268 

1 .7E-02 

7.6E-04 

4.6E-02 

11,632,569 

7 

52,938,074 

rsl  02561 47 

1.4E-03 

2.0E-03 

1.0E-03 

185,608 

7 

53,123,682 

rs6967345 

3.9E-04 

7.8E-02 

7.1E-02 

3,229 

7 

53,126,911 

rsl  1238249 

3.6E-04 

8.3E-02 

4.9E-02 

25,310,608 

7 

78,437,519 

rs987435 

2.2E-03 

2.9E-04 

6.2E-04 

3,369,171 

7 

81,806,690 

rsl  1973223 

9.7E-05 

2.7E-03 

3.6E-04 

1 ,356,903 

Pass 

7 

83,163,593 

rs7806361 

4.5E-04 

9.4E-03 

1 .0E-02 

11,197 

7 

83,174,790 

rs505900 

4.0E-05 

4.1E-03 

4.9E-03 

19,988 

Pass 

7 

83,194,778 

rsl 721 6863 

2.5E-04 

4.4E-02 

3.9E-02 

3,060,179 

7 

86,254,957 

rsl 0280549 

8.6E-02 

2.1E-02 

2.1E-04 

824 

7 

86,255,781 

rs2299219 

9.2E-02 

3.1E-02 

5.3E-04 

2,964,497 

7 

89,220,278 

rs4728833 

2.4E-02 

1.2E-01 

5.8E-04 

416,079 

7 

89,636,357 

rs21 58747 

3.5E-02 

5.1E-04 

3.5E-02 

775,193 

7 

90,411,550 

rsl 0250625 

3.8E-04 

4.8E-02 

3.4E-03 

804,386 

7 

91,215,936 

rs803271 

8.3E-04 

1 .9E-02 

3.3E-02 

2,029,706 

7 

93,245,642 

rsl  686851 4 

6.0E-03 

2.7E-03 

9.0E-04 

4,024,085 

7 

97,269,727 

rsl 229556 

9.1E-05 

8.7E-05 

4.6E-04 

1 ,797,644 

Fail 

7 

99,067,371 

rs3735453 

1.8E-05 

5.4E-04 

1.2E-03 

5,808,558 

Fail 

7 

104,875,929 

rs6977560 

3.9E-03 

2.0E-04 

4.7E-04 

271 ,002 

7 

105,146,931 

rs4727625 

2.9E-02 

1.1E-04 

2.6E-02 

326 

7 

105,147,257 

rs538249 

1 .6E-02 

4.5E-04 

2.4E-02 

17,589 

7 

105,164,846 

rs974548 

5.3E-04 

1.1E-04 

7.8E-04 

3,930,697 

7 

109,095,543 

rs820932 

1.5E-02 

2.4E-04 

1.4E-03 

343,388 

7 

109,438,931 

rs2711371 

6.4E-04 

6.2E-03 

5.2E-03 

3,687,480 

7 

113,126,411 

rs2 188422 

8. IE-04 

1.4E-03 

2.1E-02 

3,381,855 

7 

116,508,266 

rsl  27061 41 

8.3E-03 

7.3E-04 

3.8E-02 

5,261,879 

7 

121,770,145 

rs886497 

4.4E-03 

7.6E-04 

5.4E-03 

75,817 

7 

121,845,962 

rs3807874 

7.1E-02 

9.2E-02 

6.2E-04 

342,820 

7 

122,188,782 

rs715205 

3.5E-04 

2.6E-02 

1.3E-03 

5,925,828 

7 

128,114,610 

rs6976879 

5.2E-04 

9.5E-02 

4.5E-03 

1 ,304,707 

7 

129,419,317 

rsl  1768743 

1.9E-05 

2.0E-04 

1.0E-03 

31,415 

Pass 

7 

129,450,732 

rsl  1556924 

9.2E-04 

2.1E-04 

9.2E-04 

1 ,470,592 

7 

130,921,324 

rsl 643286 

9.2E-03 

1.5E-04 

3.5E-04 

2,556,673 

7 

133,477,997 

rs7782175 

3.0E-04 

3.3E-02 

3.4E-02 

12,504 

7 

133,490,501 

rs997316 

3.8E-04 

4.2E-02 

6.3E-02 

7,554,556 

7 

141,045,057 

rsl 0261 374 

4.8E-02 

8.0E-04 

8.9E-02 

138,581 

7 

141,183,638 

rs7806657 

1.5E-04 

2.9E-04 

1.6E-03 

128 

7 

141,183,766 

rs3823668 

1.6E-04 

4.0E-04 

1.2E-03 

2,817 

7 

141,186,583 

rs7785936 

3.0E-04 

5.4E-04 

9. IE-04 

660 

7 

141,187,243 

rsl 0454325 

2.4E-04 

4.3E-04 

1. IE-03 

21,867 

7 

141,209,110 

rsl  655263 

1.0E-04 

1.9E-04 

1.0E-03 

33,015 

7 

141,242,125 

rsl 726842 

2.7E-04 

5.4E-04 

1.8E-03 

11,729 

7 

141,253,854 

rs2471744 

1.2E-04 

3.4E-04 

1. IE-03 

20,554 

7 

141,274,408 

rsl 285935 

1.2E-04 

3.2E-04 

1. IE-03 

2,634,295 

7 

143,908,703 

rs78041 57 

2.9E-03 

7.8E-04 

9. IE-03 

295,516 

7 

144,204,219 

rs6967341 

5.2E-03 

6.6E-04 

4.8E-03 

51,399 

7 

144,255,618 

rsl 404829 

7.2E-03 

4.3E-04 

1.2E-03 

643,523 

7 

144,899,141 

rs6943077 

5.7E-03 

1 .6E-02 

4.2E-04 

27,876 

7 

144,927,017 

rs6960519 

1.4E-03 

4.4E-03 

4.5E-04 

12,258 

7 

144,939,275 

rs850498 

3.1E-03 

6.2E-03 

7.4E-04 

3,542,871 

7 

148,482,146 

rs917124 

3.4E-01 

6.5E-02 

8.6E-04 

3,774,919 

7 

152,257,065 

rsl  860711 

1.1E-02 

7.7E-04 

1.0E-01 

Intergenic 


Intergenic 

Intergenic 

Intergenic 

Intergenic 

Intergenic 


Intergenic 

Intergenic 

Intergenic 


CACNA2D1 

SEMA3E 


ASNS 

ZNF498 


ZC3HC1 


14,472 

tcag.1136 

3,729 

tcag.1136 

tcag.1136 

773 

Intergenic 

17,872 

Intergenic 

303 

Intergenic 

611 

Intergenic 

828 

Intergenic 

50 

Intergenic 

18,609 

Intergenic 

261 

Intergenic 

699 

Intergenic 

16,338 

Intergenic 

4,298 

Intergenic 

4,891 

Intergenic 

1,252 

Intergenic 

32,298 

Intergenic 

Intergenic 

CACNA2D1 

11,197  SEMA3E 

19,988  SEMA3E 

SEMA3E 


ASNS 

ZNF498 

326  ATXN7L4 

17,589  ATXN7L4 

ATXN7L4 


31,415  ZC3HC1 

ZC3HC1 


128 

Olfactory  Receptor 

2,817 

Olfactory  Receptor 

660 

Olfactory  Receptor 

21,867 

Olfactory  Receptor 

33,015 

Olfactory  Receptor 

11,729 

Olfactory  Receptor 

20,554 

Olfactory  Receptor 
Olfactory  Receptor 

27,876 

Intergenic 

12,258 

Intergenic 

Intergenic 

8  2,963,312 

8  4,250,987 

8  5,019,950 

8  5,020,832 

8  5,022,215 

8  5,581,540 

8  8,157,397 

8  8,161,920 

8  8,845,002 

8  9,097,487 

8  9,123,433 

8  9,182,492 

8  9,687,101 

8  10,782,996 

8  16,360,423 

8  16,370,937 

8  16,382,006 

8  16,384,020 

8  16,890,680 

8  19,133,554 

8  20,185,372 

8  24,511,761 

8  25,501,360 

8  25,511,247 

8  27,819,048 

8  29,675,519 

8  31,755,819 

8  31,765,885 

8  33,432,871 

8  36,899,367 

8  36,907,851 

8  36,988,350 

8  49,574,654 

8  53,481,197 

8  59,231,091 

8  62,096,264 

8  69,461 ,971 

8  72,362,997 

8  72,382,875 

8  73,023,814 

8  73,210,926 

8  74,710,831 

8  75,415,185 

8  75,442,257 

8  75,442,495 

8  76,743,329 

8  90,164,239 

8  102,232,276 

8  102,234,459 

8  110,867,787 

8  127,250,926 

8  127,251,212 

8  127,253,740 

8  127,253,983 

8  128,699,703 

8  134,004,474 

8  134,010,808 

8  136,836,421 

8  140,402,258 

9  929,738 

9  1,456,336 

9  1,916,444 

9  6,685,502 

9  11,035,538 

9  14,376,476 

9  14,382,785 

9  14,879,834 

9  17,584,012 

9  19,157,849 

9  21,818,110 

9  24,715,459 

9  28,054,791 

9  28,432,343 

9  31,884,144 

9  31 ,884,764 

9  32,049,777 

9  32,068,175 

9  32,072,226 

9  32,084,193 

9  32,084,283 

9  32,089,879 


rs2045638  8.8E-04 

rsl  7336043  1.1E-02 

rs7839924  1 .8E-02 

rsl  2545609  2.2E-02 

rs7838077  8.4E-03 

rs2527120  1.9E-03 

rs872137  4.4E-02 

rs867657  8.6E-02 

rs7844524  4.2E-02 

rsl  053036  5.0E-03 

rs4841110  5.1E-03 

rs7839174  1.0E-02 

rs7829463  1.1E-01 

rs2898249  2.8E-02 

rsl  464999  2.6E-02 

rsl  526447  3.4E-03 

rs9325791  1 .2E-02 

rsl  526442  4.7E-03 

rs7005198  7.4E-03 

rs954911  2.9E-04 

rs4922151  1.4E-03 

rsl  1783793  9.3E-04 

rs2979734  7.5E-04 

rs2979740  5.0E-03 

rs45 15501  4.4E-04 

rs6558150  1.2E-02 

rsl  76601 63  4.8E-03 

rs2202262  1.1E-02 

rs2732315  1.7E-03 

rsl  01 0669  6.3E-02 

rs6986249  4.9E-02 

rs3015758  3.9E-02 

rs6991400  3.2E-04 

rsl  31 71 86  1.9E-02 

rsl  1774052  8.5E-05 

rsl  6926700  6.3E-04 

rsl  6934533  3.9E-09 

rs4738123  5.0E-03 

rs920966  1 .7E-02 

rs7843557  3.8E-02 

rsl  0504531  9.8E-04 

rs7817465  4.4E-04 

rsl  0504574  6.1E-04 

rs6993852  4.9E-04 

rs4738452  6.2E-04 

rs9298261  6.0E-03 

rsl  681 437  2.1E-03 

rsl  7381 988  5.4E-03 

rsl  1990866  7.8E-03 

rs3133919  5.3E-04 

rsl  6901 179  3.4E-03 

rsl  6901 182  7.0E-03 

CG08023401  2.7E-03 

CG08023402  1.9E-03 

rs4354269  2.2E-02 

rs2687810  1.1E-02 

rs853296  6.4E-03 

rs7837809  9.9E-05 

rs994769  2.7E-03 

rsl  0977650  4.4E-06 

rsl  0961 586  1.6E-03 

rs7045276  3.3E-04 

rsl  094040  2.7E-04 

rsl  01 221 35  2.3E-02 

rs662526  1.5E-05 

rs2208919  2.6E-04 

rsl  6892  2.5E-02 

rs7859530  5.5E-04 

rs2451648  1.0E-03 

rs3900787  2.8E-02 

rs7870157  1  .IE-04 

rs21 23205  2.4E-02 

rsl  0491888  3.1E-04 

rsl  0970584  1 .7E-02 

rs7024321  6.8E-03 

rs439005  5.1  E -03 

rsl  50831 8  1. IE-03 

rsl  0970751  2.9E-03 

rsl  0970761  1.4E-03 

rsl  0970762  1.7E-03 

rsl  081 3733  1.2E-03 


7.6E-03  3.8E-04 

5.7E-04  2.7E-02 

1.5E-02  1  .IE-04 

2.2E-02  2.5E-04 

3.3E-02  3.2E-04 

2.5E-04  1.4E-02 

5.0E-04  3.6E-03 

5.7E-04  2.7E-03 

6.9E-02  3.1E-04 

1.7E-03  3.0E-04 

1.9E-03  1.9E-04 

8.3E-04  8.0E-03 

7.7E-04  1.3E-02 

2.0E-03  8.2E-04 

8.6E-04  2.2E-01 

2.5E-04  3.3E-02 

7.5E-04  6.2E-02 

7.1E-04  2.2E-02 

3.6E-03  6.3E-04 

5.6E-02  1.1E-01 

1.7E-03  1.4E-04 

1.0E-01  1.2E-01 

3.9E-03  2.7E-04 

2.1E-02  4.7E-04 

8.4E-03  1.1E-01 

2.7E-03  5.8E-04 

2.8E-04  1.3E-02 

8.0E-04  2.1E-02 

2.8E-03  2.3E-04 

1.6E-02  3.9E-05 

9.4E-03  5.9E-05 

2.5E-02  8.6E-04 

5.5E-04  2.3E-03 

1.3E-03  8. IE-04 

2.0E-04  1.8E-02 

1.1E-01  5.8E-03 

2.8E-04  4.0E-04 

6.2E-05  8.7E-04 

6.9E-05  1.2E-03 

4.7E-03  7.2E-04 

6.1E-03  4.6E-01 

6.1E-03  3.0E-02 

1.8E-01  1.3E-01 

1.8E-01  1.9E-01 

1.6E-01  1.7E-01 

2.5E-03  7.4E-04 

9.3E-04  1.8E-02 

3.5E-04  8.8E-04 

7.3E-04  8.8E-04 

1.8E-02  8.8E-03 

9.2E-04  3.0E-03 

8. IE-04  1.9E-03 

5.8E-04  2.8E-03 

9.5E-04  5.0E-04 

7.4E-03  6.8E-04 

7.0E-03  8.8E-04 

3.9E-03  4.2E-04 

1  .IE-03  1.9E-02 

9.6E-04  1.8E-02 

6.8E-04  3.0E-03 

3.9E-04  1.0E-02 

7.9E-04  1.3E-03 

8.8E-04  5.2E-03 

1.3E-03  9. IE-04 

5.5E-04  6.7E-03 

8.4E-03  1.6E-02 

2.7E-04  1.3E-03 

1.9E-04  1.2E-02 

9.0E-06  4.0E-03 

2.3E-02  6.1E-04 

7.0E-04  5.1E-03 

8.5E-04  9.0E-02 

2.0E-02  1 .6E-02 

3.5E-02  5.8E-04 

1.0E-02  3.4E-04 

5.4E-03  2.7E-04 

8.2E-03  4.1E-04 

1.2E-02  3.1E-04 

1.1E-02  5.0E-04 

1.2E-02  2.8E-04 

7.2E-03  1.6E-04 


1 ,287,675 

768,963 

882 

1,383 

559,325 

2,575,857 

4,523 

683,082 

252,485 

25,946 

59,059 

504,609 

I , 095,895 
5,577,427 
10,514 

II, 069 
2,014 
506,660 
2,242,874 
1,051,818 
4,326,389 
989,599 
9,887 

2.307.801 
1 ,856,471 
2,080,300 
10,066 

1 ,666,986 
3,466,496 

8,484  Pass 

80,499  Pass 

12,586,304 
3,906,543 
5,749,894 

2,865,173  Pass 

7,365,707 

2,901,026  Fail 

1 9,878  Pass 

640,939  Pass 

187,112 

1 ,499,905 

704,354 

27,072 

238 

1 ,300,834 
13,420,910 
12,068,037 
2,183 
8,633,328 
16,383,139 
286 
2,528 
243 

1 ,445,720 
5,304,771 
6,334 
2,825,613 

3.565.837  Pass 

526,598  Pass 

460,108 

4,769,058 

4,350,036 

3,340,938 

6,309  Pass 

497,049 

2,704,178 

1.573.837 

2,660,261  Poor 

2,897,349 
3,339,332 
377,552 

3.451.801 
620 

165,013 

18,398 

4,051 

11,967 

90 

5,596 

85,763 


FKSG2 

FKSG2 


FAM110B 

C8orf34 

EYA1 

EYA1 


Intergenic 

DMRT1 

NFIB 


Intergenic 


882  LOC648237 

1,383  LOC648237 

LOC648237 


10,514  MRPL49P2 

11,069  MRPL49P2 

2,014  MRPL49P2 

MRPL49P2 


FKSG2 

FKSG2 


FAM110B 


19,878 


C8orf34 

EYA1 

EYA1 


27,072  GDAP1 

238  GDAP1 

GDAP1 


286 

2,528 

243 


Intergenic 

Intergenic 

Intergenic 

Intergenic 


Intergenic 

DMRT1 


NFIB 

NFIB 


Intergenic 


18,398 

Intergenic 

4,051 

Intergenic 

11,967 

Intergenic 

90 

Intergenic 

5,596 

Intergenic 

Intergenic 

9  32,175,642 

9  33,845,138 

9  34,572,883 

9  34,609,621 

9  35,465,391 

9  35,516,070 

9  35,932,406 

9  35,935,975 

9  36,151,433 

9  38,728,489 

9  38,729,117 

9  72,789,645 

9  77,752,446 

9  88,694,862 

9  89,509,234 

9  89,516,915 

9  89,528,325 

9  89,532,495 

9  89,544,048 

9  89,552,906 

9  91 ,744,788 

9  91,757,048 

9  91 ,764,764 

9  91 ,765,480 

9  91,766,284 

9  91,766,779 

9  91,766,807 

9  91 ,767,437 

9  91,769,410 

9  91,770,540 

9  100,296,841 

9  103,684,909 

9  106,289,415 

9  110,100,784 

9  111,264,564 

9  117,733,470 

9  121,999,881 

9  122,010,630 

9  122,520,517 

9  127,782,685 

9  127,788,070 

9  128,309,667 

9  128,318,676 

9  135,124,929 

9  137,384,871 

10  3,172,026 

10  3,173,747 

10  3,178,812 

10  3,181,527 

10  3,196,365 

10  6,132,099 

10  6,348,652 

10  8,995,283 

10  10,289,033 

10  11,125,170 

10  11,185,259 

10  13,703,017 

10  13,707,784 

10  13,719,714 

10  13,746,373 

10  13,746,414 

10  14,571,536 

10  17,523,193 

10  18,619,743 

10  23,476,515 

10  23,595,022 

10  28,699,446 

10  30,092,558 

10  30,150,521 

10  30,164,780 

10  30,741,822 

10  33,407,810 

10  33,451,441 

10  33,463,699 

10  33,908,380 

10  49,968,527 

10  50,578,070 

10  54,553,366 

10  54,555,442 

10  54,571,725 

10  54,572,148 


rsl  0970830  3.3E-03 

rsl  0971 738  2.1E-02 

rs7032335  8.6E-04 

rsl  09721 68  3.6E-04 

rsl  0972486  4.7E-04 

rsl  329841 5  6.7E-04 

rs7863401  2.2E-03 

rs7019283  2.7E-03 

rs7851615  6.8E-04 

rsl  1794651  8.2E-03 

rs7861857  4.2E-02 

rsl  895063  6.2E-03 

rs2377425  1.6E-03 

rsl  0868500  5.6E-04 

rs31 18860  4.8E-02 

rs3 128499  6.6E-02 

rs941 0942  1.7E-02 

rs2274611  1.4E-02 

rs3128514  2.6E-03 

rs3118840  9.0E-03 

rs7864499  2.0E-04 

rs749194  1  .IE-03 

rsl  0993230  5.7E-04 

rsl  0993257  7.0E-04 

rsl  0821 382  8.8E-04 

rsl  0993330  6.7E-04 

rsl  0993331  7.4E-04 

rsl  0761 354  6.3E-04 

rsl  992544  5.8E-04 

rsl  0993476  7.7E-04 

rsl  01 25760  7.4E-04 

rs9299353  7.7E-03 

CG09016407  5.9E-02 

rsl  0816654  5.8E-04 

rs3935785  2.0E-02 

rs6478214  4.0E-04 

rsl  0984868  3.4E-03 

rs10217145  2.7E-03 

rs4617229  1.2E-03 

rsl  0987074  3.3E-02 

rsl  571 570  3.4E-02 

rsl  6929088  2.1E-03 

rs4837098  1.4E-03 

rs81 76707  2.7E-04 

rsl  1789038  8.0E-04 

rs4880595  6.5E-04 

rs7094698  8.2E-04 

rs7898290  7.3E-04 

rs2388557  9.8E-04 

rs4620624  7.8E-04 

rsl  0905669  6.2E-02 

rs4750200  2.0E-02 

rsl  822801  5.6E-02 

rs2759640  2.1E-02 

rsl  291 870  2.5E-03 

rsl  080845  6.7E-04 

rsl  0906425  1.8E-02 

rs7089243  9.4E-03 

rsl  541 01 9  9.2E-03 

rs4750396  2.3E-02 

rs4748050  2.8E-02 

rsl  0508474  7.3E-04 

rsl  1254600  2.6E-04 

rsl  0828424  9.8E-03 

rsl  082841 2  8.9E-03 

rs4074209  3.0E-03 

rsl  249575  8.8E-04 

rsl  01 3610  8.8E-06 

rsl  1007760  1  .IE-04 

rs21 50556  1.6E-04 

rs6481677  1.7E-02 

rs2666275  7.2E-03 

rs2070302  1 .6E-01 

rs2093055  3.7E-03 

rs6481861  6.8E-04 

rs7074818  7.6E-04 

rsl  258323  5.3E-04 

rsl  006677  2.5E-02 

rsl  0762926  2.8E-02 

rs4408237  5.3E-02 

rsl  0824900  5.8E-02 


4.4E-04  4.9E-05 

1.6E-02  2.5E-04 

4.5E-03  2.2E-03 

5.6E-03  6.2E-03 

1 .2E-02  1 .4E-02 

1  .IE-03  1  .IE-03 

3.6E-04  1.2E-03 

4.1E-04  1.6E-03 

2.6E-02  5.7E-04 

4.4E-04  2.1E-02 

7.7E-04  3.8E-02 

4.2E-03  7.3E-04 

2.5E-04  7.2E-04 

6.1E-03  4.3E-02 

6.0E-02  8.2E-04 

5.6E-02  9.2E-05 

1.3E-02  1.0E-04 

3.4E-02  4.3E-04 

3.3E-03  8.8E-05 

6.7E-02  6.5E-04 

5.1E-04  2.6E-03 

9.1E-04  8.1E-03 

6.4E-04  6.2E-03 

6.5E-04  6.2E-03 

8.6E-04  7.0E-03 

6.5E-04  5.3E-03 

9.0E-04  4.6E-03 

6.3E-04  5.1E-03 

6.3E-04  5.2E-03 

6.5E-04  6.2E-03 

3.9E-03  2.0E-03 

7.6E-04  1.0E-01 

4.0E-02  3.2E-04 

2.3E-02  3.6E-03 

3.3E-04  5.1E-04 

1 .2  E -04  2.8E-03 

3.0E-02  4.0E-04 

6.7E-02  7.5E-04 

6.6E-04  6.2E-03 

2.8E-04  6.1E-02 

9.8E-04  7.3E-02 

5.6E-05  4.0E-02 

3.6E-04  3.2E-02 

1.5E-02  6.1E-03 

2.3E-03  2.4E-01 

7.0E-03  1.2E-02 

4.0E-03  2.5E-03 

2.6E-03  8.5E-03 

9.2E-03  5.1E-03 

2.2E-02  3.8E-02 

1.0E-03  3.9E-02 

1.6E-02  9.7E-04 

8.6E-04  5.6E-04 

3.2E-03  6.1E-04 

1.9E-02  3.6E-04 

7.9E-03  6.1E-02 

7.7E-04  7.1E-03 

3.1E-04  4.9E-03 

4.7E-04  6.3E-03 

7.0E-04  2.5E-02 

8.2E-04  2.0E-02 

1.7E-02  7.8E-03 

1.0E-02  9.8E-02 

2.3E-04  1.5E-02 

6.1E-04  2.8E-03 

9.9E-04  2.5E-02 

6.1E-03  1.6E-01 

4.4E-05  2.2E-05 

1.6E-04  6.7E-03 

2.8E-04  7.0E-03 

5.7E-03  5.3E-04 

9.8E-04  7.0E-03 

8.5E-03  3.1E-04 

3.4E-04  3.1E-01 

2.3E-03  1.3E-03 

2.4E-03  7.5E-03 

8.5E-03  7.2E-04 

7.8E-04  2.9E-03 

7.8E-04  2.9E-03 

9. IE-04  1.5E-03 

9.5E-04  4.2E-03 


I , 669,496  Pass 
727,745 

36,738 

855,770 

50,679 

416,336 

3,569 

215,458 

2,577,056 

628 

34,060,528 

4,962,801 

10,942,416 

814,372 

7.681 

II, 410  Pass 

4,170 

11.553 

8,858  Pass 

2,191,882 
12,260 
7,716 
716 
804 
495 
28 
630 
1,973 
1,130 
8,526,301 
3,388,068 
2,604,506 
3,811,369 
1,163,780 
6,468,906 
4,266,411 
10,749 
509,887 
5,262,168 
5,385 
521 ,597 

9,009  Pass 

6,806,253 

2,259,942 

1,721 

5,065 

2,715 

14,838 

2,935,734 

216.553 

2.646.631 

I , 293,750 
836,137 
60,089 
2,517,758 
4,767 

II, 930 
26,659 
41 

825,122 

2,951,657 

1 ,096,550 

4,856,772 

118,507 

5,104,424 

1,393,112 

57,963  Fail 

14,259 

577,042 

2,665,988 

43.631 
12,258 

444.681 
16,060,147 
609,543 
3,975,296 
2,076 
16,283 
423 
17,517 


ABO 


IL2RA 


MAP3K8 


NRP1 


Intergenic 


Intergenic 


7,681 

CTSL 

CTSL 

11,410 

CTSL 

4,170 

CTSL 

11,553 

CTSL 

CTSL 

8,858 

CTSL 

CTSL 

12,260 

IL6RL1 

7,716 

IL6RL1 

716 

IL6RL1 

804 

IL6RL1 

495 

IL6RL1 

28 

IL6RL1 

630 

IL6RL1 

1,973 

IL6RL1 

1,130 

IL6RL1 

IL6RL1 

FAM125B  9,009  FAM125B 

FAM125B 

135,120,384  135,140,451  Strong  ABO 

1,721  PITRM1 

5,065  PITRM1 

2,715  PITRM1 

14,838  PITRM1 

PITRM1 

6,093,512  6,144,278  Strong  IL2RA  IL2RA 


4,767 

PRPF18 

11,930 

PRPF18 

26,659 

PRPF18 

41 

PRPF18 

PRPF18 

CKS1BP2 


CKS1BP2 


30,762,872  30,790,767 


33,506,432  33,663,839 


MAP3K8 

43,631  LOC401640 

12,258  LOC401640 

NRP1 


2,076 

16,283 

423 

17,517 


Intergenic 

Intergenic 

Intergenic 

Intergenic 


10 

54,589,665 

rs23841 20 

3.2E-02 

7.4E-04 

10 

70,933,270 

rsl  227969 

6.3E-04 

4.8E-04 

10 

70,942,140 

rsl 050931 3 

7.7E-04 

1 .4E-02 

10 

70,942,998 

rsl 900463 

8.6E-04 

1.6E-03 

10 

72,030,065 

rsl 0999427 

7.5E-04 

6.3E-03 

10 

73,478,473 

rsl  245574 

5.9E-03 

1.3E-04 

10 

73,528,555 

rsl  6681 56 

3.8E-03 

3.2E-04 

10 

73,601 ,570 

rsl  1000201 

1.2E-04 

1.7E-05 

10 

73,625,658 

rsl  100021 2 

6.2E-04 

8.7E-05 

10 

80,253,095 

rs7098202 

4.0E-04 

1 .7E-02 

10 

80,552,335 

rs704002 

1.4E-05 

2.1E-05 

10 

80,691 ,847 

rs4980054 

8.7E-03 

1.4E-03 

10 

85,444,336 

rsl  931 146 

1 .0E-02 

7.9E-04 

10 

89,472,813 

rsl 0887746 

3.7E-02 

5.8E-02 

10 

90,473,241 

rs390436 

5.0E-02 

7.0E-02 

10 

90,475,052 

rs434401 

6.1E-02 

1.1E-01 

10 

90,478,444 

rs427687 

4.9E-02 

7.3E-02 

10 

100,359,992 

rs526465 

6.4E-02 

7.4E-04 

10 

104,709,086 

rsl 241 3409 

3.5E-02 

6.3E-02 

10 

106,145,688 

rsl  11 92038 

1.7E-04 

7.4E-03 

10 

106,622,111 

rs2451498 

7.3E-03 

9.8E-04 

10 

110,162,502 

rsl  75461 4 

8.7E-04 

6.3E-04 

10 

110,162,689 

rsl 762059 

2.2E-04 

3.1E-04 

10 

110,193,638 

rsl 977486 

1.7E-04 

2.4E-04 

10 

110,199,746 

rs4528245 

6.3E-05 

3.8E-04 

10 

110,253,321 

rsl  831 586 

7.6E-04 

1.0E-03 

10 

110,253,388 

rsl 332274 

2.7E-04 

2.8E-04 

10 

117,947,120 

rs3824842 

4.0E-03 

3.6E-02 

10 

122,648,007 

rs6585674 

7.3E-04 

1 .2E-02 

10 

123,825,083 

rsl  1200381 

1 .5E-02 

1.5E-03 

10 

124,104,422 

rs7905027 

2.6E-04 

3.9E-02 

10 

129,662,632 

rsl  101 5984 

6.5E-03 

2.8E-04 

10 

133,666,820 

rs869845 

2.2E-03 

9.1E-04 

10 

133,667,732 

rsl  1596895 

1.2E-03 

3.9E-04 

10 

133,792,769 

rsl  491 192 

6.9E-03 

9.9E-04 

10 

133,795,686 

rs2814184 

1. IE-03 

6.5E-04 

11 

932,344 

rsl 0751 669 

1.6E-03 

1.5E-03 

11 

936,070 

rsl 0794350 

3.3E-03 

2.1E-03 

11 

936,985 

rsl 2278229 

1.9E-03 

1.6E-03 

11 

940,293 

rs7395402 

2.0E-03 

1.5E-03 

11 

948,566 

rsl 0902249 

2.3E-03 

1.8E-03 

11 

953,980 

rsl  1246357 

2.2E-03 

1.8E-03 

11 

954,271 

rs71 05477 

2.6E-03 

1.9E-03 

11 

982,691 

rsl 0902263 

7.7E-04 

2.3E-03 

11 

1,854,016 

rsl  806911 

2.4E-09 

3. IE-04 

11 

2,781,804 

rsl  631 66 

4.4E-05 

1 .7E-02 

11 

4,364,301 

rs2269328 

3.5E-03 

2.9E-04 

11 

6,267,802 

rs71 04040 

6.8E-02 

8.0E-04 

11 

18,266,276 

rsl 520884 

7.6E-04 

2.9E-02 

11 

18,313,756 

rs41 50579 

8.0E-04 

1 .2E-02 

11 

18,322,286 

rs41 50622 

8.2E-04 

1 .8E-02 

11 

18,340,025 

rs41 50673 

8.0E-04 

1 .4E-02 

11 

19,873,721 

rsl  102531 9 

6. IE-05 

1.9E-03 

11 

20,907,437 

rsl  083341 2 

4.5E-04 

8.4E-02 

11 

26,543,709 

rs293980 

7.9E-03 

1.1E-04 

11 

28,134,118 

rsl 032090 

3.5E-01 

9.3E-02 

11 

28,248,131 

rsl 564745 

3.4E-01 

8.8E-02 

11 

28,248,654 

rsl 032091 

3.5E-01 

9.5E-02 

11 

30,285,751 

rsl 0835649 

1 .3E-02 

8.3E-04 

11 

30,328,191 

rsl 2363432 

1 .3E-02 

5.6E-04 

11 

61,672,399 

rs3890102 

2.7E-02 

5.8E-04 

11 

61,797,095 

rsl 7709552 

1 .2E-02 

5.1E-04 

11 

67,813,346 

rsl  1606508 

2.2E-03 

3.1E-03 

11 

69,506,242 

rsl 0793046 

1 .7E-02 

2.5E-03 

11 

69,506,281 

rsl 0793047 

9.2E-02 

1 .4E-02 

11 

79,493,493 

rsl 2802458 

4.7E-04 

8.7E-03 

11 

83,266,085 

rsl  160051 2 

6.9E-03 

3.5E-03 

11 

83,278,924 

rsl 71 46322 

7.3E-03 

3.9E-03 

11 

83,306,399 

rs790363 

1.0E-03 

8. IE-04 

11 

84,655,656 

rs516620 

6.6E-03 

9.5E-04 

11 

88,070,572 

rs904462 

1.0E-02 

8.2E-04 

11 

95,587,151 

rs525068 

8.6E-04 

6.7E-02 

11 

106,725,669 

rs573449 

9.5E-04 

6.0E-03 

11 

106,762,375 

rs660962 

8.5E-04 

7.0E-03 

11 

106,792,349 

rs520729 

7.1E-04 

8. IE-03 

11 

106,800,778 

rs632910 

9.3E-04 

3.5E-03 

11 

106,820,087 

rs652429 

6.7E-04 

3.6E-03 

11 

106,820,521 

rs654272 

5.7E-04 

2.0E-03 

11 

106,829,486 

rs476028 

8.2E-04 

3.6E-03 

11 

106,846,750 

rs4753806 

4.9E-04 

5.8E-03 

3.7E-03  16,343,605 

1 .0E-02  8,870 

3.0E-02  858 

1.0E-03  1,087,067 

4.2E-03  1,448,408 

3.5E-02  50,082 

4.4E-03  73,015 

1.3E-03  24,088  Pass 

3.4E-03  6,627,437  Pass 

2.9E-02  299,240 

4.0E-05  139,512  Fail 

9.8E-05  4,752,489  Pass 

3.8E-03  4,028,477 

5.8E-04  1,000,428 

9.8E-04  1,811 

7.6E-04  3,392 

8.8E-04  9,881,548 

1 .8E-02  4,349,094 

9.7E-04  1,436,602 

9.7E-02  476,423 

1.4E-03  3,540,391 

2.0E-02  187 

7.2E-03  30,949 

6.6E-03  6,108 

1.1E-02  53,575  Pass 

3.4E-02  67 

5.2E-02  7,693,732 

6.2E-04  4,700,887 

1.0E-02  1,177,076 

9.4E-05  279,339  Pass 

5.1E-03  5,558,210 

2.1E-02  4,004,188 

7.5E-03  912 

3.2E-03  125,037 

2.7E-03  2,917 

7.1E-03 

2.8E-04  3,726 

3.4E-04  915 

2.8E-04  3,308 

2.8E-04  8,273 

3.5E-04  5,414 

3.5E-04  291 

3.5E-04  28,420 

1.0E-03  871,325 

1.4E-03  927,788  Fail 

3.3E-03  1,582,497  Pass 

9.7E-03  1,903,501 

8.6E-03  11,998,474 

1 .6E-01  47,480 

8.7E-02  8,530 

8.7E-02  17,739 

8.6E-02  1 ,533,696 

2.3E-03  1,033,716  Poor 

1 .9E-02  5,636,272 

5.5E-03  1,590,409 

8.5E-04  114,013 

7.2E-04  523 

8.5E-04  2,037,097 

3.2E-02  42,440 

5.4E-02  31 ,344,208 

1.2E-01  124,696 

3.9E-03  6,016,251 

9.4E-04  1,692,896  LRP5  67,836,711 

2.1E-04  39 

8.5E-04  9,987,212 

4.0E-02  3,772,592 

3.9E-04  12,839 

6.5E-04  27,475 

8.9E-04  1,349,257 

2.6E-02  3,414,916 

3.6E-03  7,516,579 

1.9E-02  11,138,518 

3.1E-02  36,706 

4.0E-02  29,974 

3.4E-02  8,429 

5.1E-02  19,309 

3.5E-02  434 

3.8E-03  8,965 

3.7E-02  17,264 

1 .8E-02  1 ,422 


Intergenic 

8,870  TSPAN15 

858  TSPAN15 

TSPAN15 

ASCC1  24,088  ASCC1 

ASCC1  ASCC1 

ZMIZ1  ZMIZ1 

ZMIZ1  ZMIZ1 

1,811  LIPK 

3,392  LIPK 

LIPK 


187  Intergenic 

30,949  Intergenic 

6,108  Intergenic 

Intergenic  Intergenic 


TACC2  TACC2 


3,726  AP2A2 

915  AP2A2 

3,308  AP2A2 

8,273  AP2A2 

5,414  AP2A2 

291  AP2A2 

28,420  AP2A2 

AP2A2 

LSP1  LSP1 

KCNQ1  KCNQ1 

47,480  HPS5 

8,530  HPS5 

17,739  HPS5 

HPS5 

NAV2  NAV2 


Weak  LRP5 


12,839  DLG2 

27,475  DLG2 

DLG2 

36,706  CWF19L2 

29,974  CWF19L2 

8,429  CWF19L2 

19,309  CWF19L2 

434  CWF19L2 

8,965  CWF19L2 

17,264  CWF19L2 

1,422  CWF19L2 


11  106,848,172  rs7924419 

11  106,848,316  rsl  943682 

11  107,924,629  rsl  94021 2 

11  107,937,845  rs2256362 

11  108,080,122  rs71 30825 

11  110,733,308  rs2282637 

11  115,277,018  rsl  121 571 9 

11  116,006,894  rsl  121 6058 

11  116,706,959  rs609663 

11  120,221,094  rs6589847 

11  124,396,305  rsl  121 9869 

11  124,427,978  rsl  121 9878 

11  124,444,444  rs4935903 

11  125,529,743  rs4937107 

11  128,098,703  rs947930 

11  128,779,244  rs4937426 

11  129,814,089  rsl  08941 99 

11  132,616,710  rs4937758 

11  132,652,609  rsl  941 21 7 

11  132,771,541  rsl  0894675 

11  132,977,534  rsl  1606737 

11  133,715,739  rsl  258852 

11  133,722,075  rsl  1223765 

12  1,809,513  rsl  2301 31 2 

12  2,184,580  rsl  2423277 

12  6,775,811  rsl  075836 

12  8,654,230  rsl  561 560 

12  10,659,713  rsl  2306757 

12  10,665,965  rs2900467 

12  12,100,036  rs2909003 

12  17,135,652  rs7974666 

12  17,438,215  rsl  0840701 

12  17,469,182  rs71 33850 

12  17,479,817  rsl  1043458 

12  25,133,862  rs7969931 

12  25,170,626  rsl  7387444 

12  26,146,016  rsl  2367830 

12  26,743,411  rsl  0771 294 

12  31,086,539  rsl  2822390 

12  33,448,491  rs7309981 

12  33,478,148  rs7971226 

12  37,161,778  rsl  1609286 

12  37,162,357  rs7963716 

12  37,175,605  rsl  11 68238 

12  37,225,324  rsl  11 68505 

12  39,606,758  rsl  0784951 

12  39,610,637  rsl  11 79084 

12  39,749,100  rsl  11 79687 

12  40,640,896  rsl  234032 

12  40,693,926  rsl  8821 39 

12  40,757,467  rs6582378 

12  40,784,667  rs865420 

12  46,010,425  rsl  11 83856 

12  46,409,830  rs7484827 

12  51,277,975  rs41 03862 

12  52,602,574  rsl  443504 

12  52,755,293  rs4999382 

12  52,867,331  rs2279402 

12  52,867,750  rs2279399 

12  57,298,926  rs885278 

12  57,303,229  rs21 37461 

12  57,846,706  rsl  230861 9 

12  57,865,339  rsl  11 72932 

12  57,867,116  rsl  11 72934 

12  57,888,169  rs21 73646 

12  61,593,002  rs772564 

12  62,110,882  rsl  11 74970 

12  66,478,388  rs7976772 

12  67,993,433  rsl  456233 

12  75,426,221  rsl  2298661 

12  79,851,408  rs7310000 

12  80,016,846  rsl  0862237 

12  80,076,469  rsl  1835646 

12  80,091,715  rs7964878 

12  80,092,004  rs71 35822 

12  90,745,586  rs7304552 

12  90,798,986  rsl  0507004 

12  92,907,729  rs669493 

12  93,678,633  rs6538534 

12  102,047,685  rsl  81 8702 

12  102,449,370  rsl  0778260 


6.0E-04  1.3E-02  9.6E-02 

5.6E-04  5.7E-03  2.3E-02 

7.1E-03  4.8E-04  8.9E-02 

1.3E-03  8.7E-05  1.0E-02 

2.1E-04  3.0E-04  3.8E-03 

2.5E-04  2.0E-03  5.8E-03 

3.4E-04  4.9E-03  1.4E-02 

1.6E-02  7.5E-04  6.0E-04 

2.1E-04  1.4E-02  1.5E-02 

7.1E-04  6.4E-03  7.2E-02 

9.4E-04  2.3E-01  7.8E-02 

4.4E-04  1.2E-01  1.2E-01 

6.8E-04  1.3E-01  1.3E-01 

8.5E-03  5.5E-04  6.0E-04 

1.4E-04  2.9E-02  1.1E-02 

4.8E-04  5.9E-03  3.3E-03 

2.5E-03  8.2E-04  2.7E-02 

5.2E-03  1.8E-02  5.1E-04 

1.1E-01  7.9E-02  8.8E-04 

4.8E-03  1.0E-03  5.6E-04 

2.2E-03  2.6E-04  1.3E-03 

8.6E-04  2.0E-01  1.0E-01 

2.0E-04  2.4E-03  7.8E-03 

1.0E-02  1.4E-02  7.0E-04 

1.8E-02  8.9E-04  8.4E-03 

6.0E-02  2.7E-04  5.9E-02 

3.3E-03  6.8E-03  1.6E-04 

1.1E-02  4.3E-04  1.3E-02 

1.6E-02  8.6E-04  5.1E-02 

6.3E-08  5.0E-05  4.6E-04 

4.0E-03  6.4E-04  3.3E-03 

2.5E-02  3.5E-02  1.1E-04 

1. IE-02  3.6E-02  1.0E-03 

2.8E-03  1.5E-02  9.4E-04 

5.2E-03  4.1E-04  9.9E-03 

6.2E-03  6.9E-04  1.0E-02 

3.6E-04  4.4E-03  1.9E-02 

1.4E-03  6.4E-04  4.5E-04 

6.1E-05  2.7E-03  5.7E-03 

3.3E-02  1.2E-03  8.0E-04 

6.0E-02  4.6E-03  9.8E-04 

3.6E-02  1.7E-02  9.3E-04 

2.3E-02  1.2E-02  9.0E-04 

1.7E-04  4.9E-03  7.5E-03 

1.8E-02  1.7E-02  7.0E-04 

2.5E-04  4.1E-02  3.6E-02 

1.6E-04  7.4E-02  2.1E-02 

2.8E-04  4.8E-01  1.4E-01 

1.5E-01  2.8E-04  5.7E-02 

2.5E-01  3.9E-04  3.3E-02 

2.4E-01  3.2E-04  5.6E-02 

2.9E-01  7.1E-04  6.0E-02 

3.0E-02  7.1E-04  7.6E-02 

9.5E-04  6.2E-04  7.2E-03 

5.8E-03  7.0E-04  2.6E-03 

3.9E-03  5.4E-04  9.2E-03 

6.3E-04  1.1E-01  2.5E-01 

8.3E-04  1.4E-01  9.5E-02 

8.9E-04  1.3E-01  1.1E-01 

1.9E-02  6.8E-04  2.6E-03 

2.3E-02  8.7E-04  3.2E-03 

1.5E-02  5.4E-03  1.2E-04 

4.9E-02  1.7E-02  2.1E-04 

7.4E-02  3.7E-02  2.5E-04 

1.4E-01  4.2E-02  8.3E-04 

4.0E-02  1.0E-02  6.4E-04 

4.5E-03  8.5E-04  1.3E-03 

4.0E-04  4.0E-03  3.3E-02 

2.6E-03  5.1E-03  4.3E-04 

2.4E-04  9.8E-04  6.9E-03 

2.9E-04  2.6E-03  2.5E-05 

5.8E-02  3.5E-02  8.0E-04 

1  .IE-03  9.2E-03  1.0E-04 

2.4E-03  1.2E-02  9.5E-05 

8.5E-03  2.7E-02  5.0E-04 

9.0E-04  6.9E-02  2.0E-01 

6.8E-03  6.4E-04  2.6E-03 

3.0E-04  6.8E-03  8.7E-04 

5.6E-04  1.3E-02  8.7E-03 

1.5E-03  8.5E-04  2.4E-03 

2.6E-02  1.9E-02  2.7E-04 


144 

1,076,313 

13,216 

142,277  Pass 

2.653.186 
4,543,710 
729,876 
700,065 
3,514,135 
4,175,211 
31,673 
16,466 

1 ,085,299 
2,568,960 

680.541 

1 ,034,845 
2,802,621 

35,899  OPCML 

118,932  OPCML 

205,993  OPCML 

738,205 
6,336 

375,067 

4,591,231 

1,878,419  CD4 

2,005,483 
6,252 
1 ,434,071 

5,035,616  Fail 

302,563 

30,967 

10,635 

7,654,045 

36,764 

975,390 

597,395 

4,343,128 

2,361 ,952  Poor 

29,657 

3,683,630 

579 

13,248 

49.719 
2,381 ,434 
3,879 
138,463 
891 ,796 
53,030 

63.541 
27,200 
5,225,758 
399,405 
4,868,145 
1 ,324,599 

152.719 
112,038 
419 

4,431,176 

4,303 

543,477 

18,633 

1,777 

21,053 

3,704,833 

517,880 

4,367,506 

1,515,045 

7,432,788 

4.425.187 

165,438  Pass 

59,623 

15,246 

289  Pass 

10,653,582 

53,400 

2,108,743 

770,904 

8,369,052 

401 ,685 

1 ,443,454 


131,790,085 

131,790,085 

131,790,085 


6,768,912 


144 

13,216 


CWF19L2 

CWF19L2 

EXPH5 

EXPH5 


31,673  CCDC15 

16,466  CCDC15 

CCDC15 


132,907,613 

132,907,613 

132,907,613 


OPCML 

OPCML 

OPCML 


6,800,237  Strong  Belgian,  Danish  CD4 

BCL2L14  BCL2L14 

30,967  LOC390298 

10,635  LOC390298 

LOC390298 


TSPAN11  TSPAN11 

579  Intergenic 

13,248  Intergenic 

49,719  Intergenic 

Intergenic 


18,633 

1,777 

21,053 


Intergenic 

Intergenic 

Intergenic 

Intergenic 


ACSS3 


15,246 

289 


ACSS3 

ACSS3 

ACSS3 

ACSS3 

ACSS3 


ACSS3 


12 

12 

12 

12 

12 

12 

12 

12 

12 

12 

12 

12 

12 

12 

12 

12 

12 

12 

12 

12 

12 

12 

12 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 

13 


103,892,824 

rsll  112269 

3.4E-05 

3.3E-04 

2.3E-03 

6,756,274 

Pass 

110,649,098 

rs634389 

1.9E-03 

9.6E-04 

2.0E-03 

25,723 

110,674,821 

rs6490294 

8.0E-03 

3.5E-03 

8.5E-04 

146,924 

110,821,745 

rs9971746 

5.7E-04 

4.5E-04 

5.3E-04 

123,387 

110,945,132 

rs7114 

1.5E-03 

9.6E-03 

6.9E-04 

56,893 

111,002,025 

rsl 980364 

9.6E-04 

9.4E-03 

5.3E-04 

190,784 

111,192,809 

rs2285809 

3.6E-04 

1 .3E-02 

3.6E-04 

31,889 

111,224,698 

rsl 859246 

6.4E-04 

2.1E-02 

5.0E-04 

237,462 

111,462,160 

rsl 0850053 

6.7E-04 

9.2E-03 

5.5E-04 

292,948 

111,755,108 

rsl 0850084 

1.3E-04 

3.4E-03 

8.0E-03 

2,325 

111,757,433 

rsl 0492025 

1.4E-04 

1.6E-04 

6.4E-04 

270,778 

112,028,211 

rsl 732771 

1 .5E-02 

9.4E-04 

1.2E-01 

632,949 

112,661,160 

rsl  11 4886 

9.7E-02 

6.7E-05 

4.0E-03 

2,310,075 

Pass 

114,971,235 

rs32 14009 

7.3E-04 

4.0E-03 

7.2E-03 

495,870 

115,467,105 

rsl 0774871 

2.2E-04 

1.8E-03 

2.7E-03 

1 ,903,600 

117,370,705 

rs64901 78 

5.0E-04 

1.1E-03 

8.4E-02 

808,780 

118,179,485 

rs2727688 

5.8E-03 

1.6E-03 

3.9E-04 

1 ,320,359 

119,499,844 

rsl 6950293 

1.  IE-02 

8.1E-04 

5.2E-03 

7,772,996 

127,272,840 

rsl 0847578 

9.9E-04 

1.1E-02 

6.1E-03 

1,135 

127,273,975 

rs9795848 

2.2E-03 

3.0E-02 

5.4E-04 

1,811,788 

129,085,763 

rs628195 

1.2E-03 

3.6E-04 

3.3E-02 

314,915 

129,400,678 

rs3736332 

9.7E-04 

7.7E-04 

5.4E-02 

732,908 

130,133,586 

rs7138155 

6.1E-02 

6.8E-04 

2.1E-02 

23,550,171 

rs9553161 

2.5E-04 

1.2E-01 

4.6E-02 

378 

23,550,549 

rs91 21 51 

5.0E-04 

1.6E-01 

7.4E-02 

1,169,054 

24,719,603 

rs8262 

2.4E-04 

7.3E-03 

1.5E-03 

2,655,037 

27,374,640 

rs95541 96 

6.5E-04 

1 .8E-02 

9.5E-03 

3,340 

27,377,980 

rs93 19399 

3.8E-04 

1 .6E-02 

7.5E-03 

376 

27,378,356 

rs9551419 

7.4E-04 

2.2E-02 

9.5E-03 

1 ,706,388 

29,084,744 

rs590377 

4.6E-04 

6.4E-02 

4.3E-02 

38,942 

29,123,686 

rs550984 

6.3E-04 

7.0E-02 

3.7E-02 

237,026 

29,360,712 

rs9551767 

8.4E-04 

1.7E-01 

1 .0E-02 

1,323 

29,362,035 

rs9578147 

6.8E-04 

1.5E-01 

7.6E-03 

948,093 

30,310,128 

rs7328020 

1.4E-03 

4.7E-04 

1 .7E-02 

5,046,703 

35,356,831 

rs9574777 

1.0E-01 

3.0E-04 

6.3E-02 

3,223,201 

38,580,032 

rs23241 29 

3.2E-04 

2.8E-02 

1.1E-03 

540,612 

39,120,644 

CGI  300481 6 

1 .7E-02 

2.9E-03 

2.6E-04 

4,305,486 

43,426,130 

rs7331414 

3.4E-03 

3.4E-02 

7.8E-04 

2,206 

43,428,336 

rs7322264 

4.3E-03 

2.2E-02 

9.2E-04 

8,016 

43,436,352 

rs9525895 

2.2E-03 

1 .6E-02 

7.8E-04 

1 ,804,878 

45,241 ,230 

rs9590925 

1.6E-02 

2.0E-04 

5.8E-04 

1,515,422 

46,756,652 

rs7333102 

3.0E-03 

9.4E-04 

1.6E-03 

434,390 

47,191,042 

rs6561402 

4.7E-05 

1 .2E-02 

1 .3E-02 

3,929,181 

Pass 

51,120,223 

rsl 7597531 

1.8E-04 

3.5E-03 

5.4E-02 

89,259 

51 ,209,482 

rs9535744 

2.9E-04 

1 .0E-02 

9.8E-02 

14,996 

51,224,478 

rsl 21 46894 

3.7E-04 

9.9E-03 

6.8E-02 

543 

51,225,021 

rs9526793 

3.8E-04 

1.1E-02 

9.8E-02 

14,161 

51,239,182 

rs9526795 

1.0E-04 

6.1E-03 

4.3E-02 

4,456 

51,243,638 

rs2296028 

4.0E-04 

9. IE-03 

6.3E-02 

1,018,249 

52,261,887 

rs402583 

1.1E-01 

4.9E-03 

7.6E-04 

3,572 

52,265,459 

rs886347 

2.0E-02 

6.8E-03 

3.9E-04 

69,940 

52,335,399 

rs489654 

2.2E-03 

1.6E-03 

2.4E-04 

50,760 

52,386,159 

rsl 7052520 

1 .4E-02 

3.4E-02 

2.8E-04 

2,016,241 

54,402,400 

rsl 993777 

3.6E-04 

3.6E-02 

2.8E-02 

10,481,498 

64,883,898 

rs2038825 

1.2E-03 

6.4E-04 

3.6E-02 

1 ,936,375 

66,820,273 

rs2991432 

8.6E-04 

7.0E-03 

1 .3E-02 

1 ,363,320 

68,183,593 

rs287485 

6.8E-04 

1.2E-03 

3.0E-02 

3,519,207 

71,702,800 

rs7328519 

2.4E-04 

2.2E-03 

1 .2E-02 

8,263 

71,711,063 

rs2483837 

5.0E-05 

3.3E-03 

1 .0E-02 

3,688 

Pass 

71,714,751 

rs340523 

7.0E-04 

6.7E-03 

1.0E-01 

170 

71,714,921 

rs340524 

6.6E-05 

2.0E-03 

1 .4E-02 

1,081,145 

Pass 

72,796,066 

rsl  71 91 385 

2.9E-04 

3.1E-04 

3.2E-04 

439,153 

73,235,219 

rs9318213 

1.3E-03 

2.0E-03 

4.3E-04 

1,239,166 

74,474,385 

rsl 7337503 

1 .4E-01 

2.4E-02 

8.5E-04 

5,613,231 

80,087,616 

rsl 772573 

1 .8E-02 

1.8E-04 

9.6E-03 

283,872 

80,371 ,488 

rs6563196 

1.1E-04 

1.6E-03 

5.2E-02 

11,877 

80,383,365 

rs4310745 

9.8E-05 

1.9E-03 

5.3E-02 

2,331,766 

Pass 

82,715,131 

rsl  51 2771 

7.4E-04 

8.6E-03 

1.0E-01 

2,519,561 

85,234,692 

rs9566093 

3.8E-03 

2.0E-02 

6.3E-04 

1,026 

85,235,718 

rs9575947 

3.2E-03 

1 .8E-02 

5.1E-04 

5,870 

85,241 ,588 

rs666540 

4.5E-03 

2.5E-02 

6.1E-04 

953 

85,242,541 

rs652258 

3.7E-03 

2.0E-02 

6.1E-04 

952 

85,243,493 

rs637400 

3.8E-03 

1 .9E-02 

7.6E-04 

375 

85,243,868 

rs6 17544 

3.8E-03 

2.0E-02 

7.6E-04 

1,944 

85,245,812 

rs588777 

3.6E-03 

2.2E-02 

5.9E-04 

17 

85,245,829 

rs606646 

3.8E-03 

2.5E-02 

7.2E-04 

33 

85,245,862 

rs606300 

3.5E-03 

2.0E-02 

5.9E-04 

381 

85,246,243 

rs694502 

4.4E-03 

2.7E-02 

8.7E-04 

1,103 

85,247,346 

rs669668 

2.1E-03 

1 .3E-02 

5.6E-04 

7,320 

85,254,666 

rs673100 

4.0E-03 

1 .5E-02 

8.4E-04 

3,842 

85,258,508 

rs598507 

4.8E-03 

2.6E-02 

6.1E-04 

46,596 

TRAFD1 
Cl  2orf51 

PTPN11 


PDX1 

PDX1 

PDX1 

SLC7A1 


KTR18P20 


KTR18P20 


111,047,764 

111,082,375 

111,340,919 


27,392,177 

27,392,177 

27,392,177 

28,981,551 


111,075,795  TRAFD1 

111,169,738  C12orf51 


111,432,100 


PTPN11 


RBM19 


RBM19 


27,397,410  Mixed 
27,397,410  Mixed 
27,397,410  Mixed 
29,067,721 


3,340  PDX1  PDX1 

376  PDX1  PDX1 

PDX1  PDX1 

SLC7A1 


2,206  LOC400126 

8,016  LOC400126 

LOC400126 


LOC7301 74 


LOC7301 74 


14,996 

543 

14,161 

4,456 


WDFY2 

WDFY2 

WDFY2 

WDFY2 

WDFY2 


Intergenic 

Intergenic 


8,263 

3,688 

170 


Intergenic 

Intergenic 

Intergenic 

Intergenic 


Intergenic 


11,877 


Intergenic 

Intergenic 


1,026 

5,870 

953 

952 

375 

1,944 

17 

33 

381 

1,103 

7,320 

3,842 

46,596 


SLITRK6 

SLITRK6 

SLITRK6 

SLITRK6 

SLITRK6 

SLITRK6 

SLITRK6 

SLITRK6 

SLITRK6 

SLITRK6 

SLITRK6 

SLITRK6 

SLITRK6 


13 

85,305,104 

rs449704 

1.3E-03 

3.4E-02 

7.5E-04 

1 ,952,945 

13 

87,258,049 

rsl  953341 

8. IE-04 

5.1E-03 

1.3E-03 

5,627,934 

13 

92,885,983 

rs7338402 

6.8E-03 

5.1E-04 

3.4E-02 

12,122 

13 

92,898,105 

rs9524070 

1.  IE-02 

4.3E-04 

3.5E-02 

18,471 

13 

92,916,576 

rs895193 

2.0E-04 

3.2E-05 

1.5E-03 

51,326 

Pass 

13 

92,967,902 

rs95241 1 1 

4.9E-03 

2.0E-04 

5.8E-03 

637,764 

13 

93,605,666 

rsl 328822 

7.4E-04 

1.2E-03 

4.6E-02 

3,616 

13 

93,609,282 

rs9561507 

1.9E-04 

2.5E-04 

9.0E-03 

12,798 

13 

93,622,080 

rsl  6949539 

7.9E-04 

1.2E-03 

1.1E-02 

212,699 

13 

93,834,779 

rsl  71 961 84 

6.1E-03 

1.1E-03 

2.1E-05 

32,358 

Pass 

13 

93,867,137 

rs6650322 

3.4E-01 

1  .IE-01 

7.1E-04 

2,689,635 

13 

96,556,772 

rs2055424 

3.4E-04 

2.9E-02 

2. IE-03 

6,473,442 

13 

103,030,214 

rsl 2865443 

1.2E-01 

1 .2E-02 

9.8E-04 

4,656 

13 

103,034,870 

rsl 2584285 

1. IE-01 

1.1E-02 

9.8E-04 

802 

13 

103,035,672 

rsl 287601 7 

1.3E-01 

1 .2E-02 

9.8E-04 

88,161 

13 

103,123,833 

rs9519133 

1.8E-03 

4.0E-04 

2.0E-04 

1 ,830,289 

13 

104,954,122 

rsl  161 8600 

3.2E-02 

1.2E-04 

1 .8E-02 

2,834,883 

13 

107,789,005 

rs9520845 

1.2E-03 

2.9E-04 

3.3E-02 

1 ,253,397 

13 

109,042,402 

rsl  1069790 

3.3E-05 

2.0E-03 

1.5E-03 

187,225 

Pass 

13 

109,229,627 

rs4771648 

3.3E-03 

3.5E-04 

1.7E-04 

77,719 

13 

109,307,346 

rsl  041 637 

1.6E-03 

6.0E-06 

1.8E-04 

572,812 

Pass 

13 

109,880,158 

rs4551896 

7.2E-02 

2.9E-02 

9.4E-04 

61,694 

13 

109,941,852 

rs402661 

3.2E-04 

1 .2E-02 

2.1E-03 

14 

20,744,204 

rsl 71 02539 

2.1E-02 

9.8E-03 

6.6E-04 

5,719,075 

14 

26,463,279 

rsl 956073 

1.8E-02 

8.1E-04 

4.0E-02 

1,085 

14 

26,464,364 

rsl 2588403 

6.4E-04 

2.4E-03 

1.4E-03 

10,257 

14 

26,474,621 

rsl 956075 

1.0E-02 

7.2E-04 

4.0E-02 

1,220 

14 

26,475,841 

rsl 11 60092 

9.9E-03 

3.3E-04 

1 .8E-02 

8,224 

14 

26,484,065 

rsl  0431 641 

6.1E-03 

8.1E-04 

2.1E-02 

3,626 

14 

26,487,691 

rsl  2881 169 

8.6E-05 

3.9E-04 

2.0E-03 

686 

Pass 

14 

26,488,377 

rs8011593 

8.9E-05 

2.8E-04 

1.4E-03 

5,927 

Pass 

14 

26,494,304 

rs2332880 

2.1E-04 

7.0E-04 

2.1E-03 

7,007 

14 

26,501,311 

rsl  9771 65 

5.6E-03 

6.7E-04 

2.1E-02 

740 

14 

26,502,051 

rs4983208 

1.6E-05 

1.0E-04 

1.1E-03 

282,611 

Pass 

14 

26,784,662 

rsl 0873462 

5.4E-04 

4.9E-03 

7.3E-02 

94,109 

14 

26,878,771 

rs71 45681 

8.8E-04 

3.4E-02 

7.2E-02 

93,495 

14 

26,972,266 

rs2061262 

3.3E-04 

4.8E-03 

2.5E-02 

31,103 

14 

27,003,369 

rsl  71 1271 8 

3.1E-04 

5.4E-04 

3.1E-02 

4,482,662 

14 

31,486,031 

rsl 278930 

8.9E-04 

1.7E-04 

1 .3E-02 

626,405 

14 

32,112,436 

rs8016916 

2.0E-04 

1.0E-03 

5.4E-03 

5,357,409 

14 

37,469,845 

rs801 3747 

6.8E-04 

2.5E-02 

1 .7E-02 

250 

14 

37,470,095 

rs4597229 

6.0E-04 

3.2E-02 

2.9E-02 

16,264,143 

14 

53,734,238 

rsl  01 431 89 

3.8E-03 

6.4E-02 

4.5E-04 

1,352,219 

14 

55,086,457 

rs3742571 

1.6E-02 

5.4E-03 

6.6E-04 

2,011,989 

14 

57,098,446 

rs71 56767 

2.6E-03 

2.3E-04 

6.7E-03 

2,791 ,778 

14 

59,890,224 

rs932606 

8. IE-03 

1.5E-03 

8.1E-04 

15,266 

14 

59,905,490 

rsl 254260 

4.6E-03 

8.4E-04 

1.4E-03 

7,202,158 

14 

67,107,648 

rsl 01 38094 

3.3E-04 

2.8E-03 

1.1E-02 

4,360 

14 

67,112,008 

rs4902490 

6.3E-04 

3.2E-03 

6.3E-03 

9,227,533 

14 

76,339,541 

rs2075773 

8.7E-02 

1 .9E-02 

7.7E-04 

12,763,651 

14 

89,103,192 

rsl  01 46976 

3.5E-04 

4.1E-03 

4.0E-03 

575,768 

14 

89,678,960 

rsl 2432204 

6. IE-03 

5.4E-04 

2.0E-02 

2,873,091 

14 

92,552,051 

rs3783904 

8.0E-04 

6.0E-05 

3.0E-04 

1 ,359,343 

Fail 

14 

93,911,394 

rsl  2431 67 

9.5E-04 

7.0E-03 

5.2E-03 

102,471 

14 

94,013,865 

rsl 7825644 

3.8E-03 

7.1E-04 

8.9E-04 

6,009 

14 

94,019,874 

rs8008729 

3.2E-03 

4.9E-04 

6.5E-04 

702,004 

14 

94,721 ,878 

rs8005908 

4.1E-03 

2.4E-04 

2.2E-03 

1 ,028,449 

14 

95,750,327 

rs4905462 

6.7E-02 

1.0E-01 

5.7E-04 

1 ,840,758 

14 

97,591 ,085 

rs857058 

2.2E-02 

2.5E-04 

2.0E-03 

534,991 

14 

98,126,076 

rs4900406 

1.4E-05 

2.0E-04 

7.5E-04 

94,903 

Fail 

14 

98,220,979 

rsl  2432577 

5.2E-04 

4.0E-04 

1.5E-04 

1 ,062,083 

14 

99,283,062 

rs9324014 

9.4E-03 

1 .0E-02 

1.7E-04 

15 

22,220,838 

rsl  11 61 090 

3.7E-03 

3.2E-02 

9.5E-04 

2,855,810 

15 

25,076,648 

rsl  1636700 

2. IE-06 

2.4E-04 

1.4E-04 

11,234 

Pass 

15 

25,087,882 

rs6606891 

7.0E-06 

4.3E-05 

1.3E-03 

26,953 

Pass 

15 

25,114,835 

CGI  5000874 

8.6E-04 

1 .3E-02 

6.9E-02 

599,102 

15 

25,713,937 

rs3930739 

8.5E-04 

5.1E-03 

1 .9E-02 

415 

15 

25,714,352 

rsl 767401 7 

6.9E-04 

6.6E-03 

1 .0E-02 

7,978 

15 

25,722,330 

rs6497234 

5.7E-05 

9.8E-04 

1.4E-03 

2,847 

Pass 

15 

25,725,177 

rs6497236 

1.5E-04 

1.6E-03 

2.1E-03 

26,380 

15 

25,751,557 

rsl  391 623 

7.9E-05 

2.8E-04 

4.9E-04 

474 

Pass 

15 

25,752,031 

rsl 01 62623 

4.3E-04 

8.1E-03 

1 .2E-02 

1,625 

15 

25,753,656 

rs4778192 

5.1E-04 

9.3E-03 

1 .4E-02 

105 

15 

25,753,761 

rs4778193 

1.3E-04 

3.5E-04 

6.4E-04 

1,444,120 

15 

27,197,881 

rs4779798 

1.4E-04 

5.0E-04 

1.2E-04 

3,008,156 

15 

30,206,037 

rs904952 

8.0E-04 

3.7E-03 

2.5E-03 

673,989 

15 

30,880,026 

rsl  781 6387 

8.6E-04 

2.5E-02 

2.5E-02 

65,365 

15 

30,945,391 

rs8040135 

6.8E-04 

1.2E-03 

1 .5E-02 

8,961 

15 

30,954,352 

rsl  2591 087 

2.8E-05 

7.3E-04 

3.2E-04 

14,521 

Pass 

15 

30,968,873 

rs347937 

3.0E-02 

2.3E-02 

6.0E-04 

124,261 

15 

31,093,134 

rsl  1633931 

7.2E-03 

1.3E-03 

2.6E-04 

375,795 

SLITRK6 


12,122  GPC6 

18,471  GPC6 

GPC6  GPC6 

GPC6 

3,616  GPC6 

12,798  GPC6 

GPC6 

GPC6  32,358  GPC6 

GPC6 

4,656  Intergenic 

802  Intergenic 

Intergenic 

IRS2  IRS2 

IRS2 

IRS2  IRS2 


1,085  UNGP2 

10,257  UNGP2 

1,220  UNGP2 

8,224  UNGP2 

3,626  UNGP2 

UNGP2  686  UNGP2 

UNGP2  5,927  UNGP2 

7,007  UNGP2 

740  UNGP2 

UNGP2  UNGP2 


ITPK1  ITPK1 


Intergenic  Intergenic 


Intergenic  11,234  Intergenic 

Intergenic  26,953  Intergenic 

Intergenic 

415  OCA2 

7,978  OCA2 

OCA2  2,847  OCA2 

26,380  OCA2 

OCA2  474  OCA2 

1 ,625  OCA2 

105  OCA2 

OCA2 

8,961  FMN1 

14,521  FMN1 

FMN1 


FMN1 


15 

31,468,929 

rs461 9367 

2.6E-03 

1.6E-01 

8.2E-04 

4,499,789 

15 

35,968,718 

rs71 83305 

4.8E-03 

2.1E-02 

3.6E-04 

756,591 

15 

36,725,309 

rsl 0438404 

6.8E-04 

1.0E-01 

9.8E-02 

1,425,319 

15 

38,150,628 

rsl 6970291 

3.6E-03 

6.5E-04 

1 .8E-02 

2,159,456 

15 

40,310,084 

rsl  051 8756 

4. IE-05 

1.5E-04 

2.2E-04 

5,241 ,594 

Pass 

15 

45,551 ,678 

rsl  1070587 

9.2E-04 

6.7E-03 

2.0E-04 

39,883 

15 

45,591,561 

rsl 2440759 

5.9E-03 

1 .0E-02 

7.0E-04 

12,105 

15 

45,603,666 

rsl  1070591 

3.8E-03 

7.4E-03 

8.8E-04 

16,203 

15 

45,619,869 

rs8027222 

2.8E-04 

3.4E-03 

1.3E-04 

31,721 

15 

45,651 ,590 

rsl 2592568 

1.6E-04 

1.3E-03 

7.0E-05 

7,983 

Pass 

15 

45,659,573 

rsl 6959700 

2.3E-05 

1.6E-04 

1.2E-05 

1,324 

Pass 

15 

45,660,897 

rs4774498 

2.6E-04 

3.4E-03 

9.1E-05 

18,900 

Pass 

15 

45,679,797 

rsl 2438247 

6.5E-05 

3.6E-04 

3.6E-05 

631 

Pass 

15 

45,680,428 

rsl 6959754 

8.7E-06 

9.1E-05 

6.3E-06 

9,872 

Pass 

15 

45,690,300 

rsl 6959825 

9. IE-05 

7.4E-04 

4.4E-05 

639 

Pass 

15 

45,690,939 

rs8039597 

5.2E-04 

3.4E-03 

1.4E-04 

11,044 

15 

45,701 ,983 

rsl  91 2631 

1 .6E-02 

2.4E-02 

8.7E-04 

15,730 

15 

45,717,713 

rsl 435754 

5.2E-03 

2.2E-03 

4.4E-04 

2,032 

15 

45,719,745 

rs71 63692 

4.5E-03 

1.5E-03 

3.6E-04 

7,503 

15 

45,727,248 

rsl 0851 458 

1 .7E-02 

2.2E-02 

9.0E-04 

2,509,953 

15 

48,237,201 

rs8036133 

2.9E-03 

9.7E-04 

1 .2E-02 

42,831 

15 

48,280,032 

rsl 6963406 

1 .3E-02 

6.6E-04 

5.4E-02 

7,270,724 

15 

55,550,756 

rs4774943 

7.0E-03 

4.0E-04 

1 .7E-02 

943,382 

15 

56,494,138 

rs4 14846 

8.5E-04 

1 .3E-02 

2.2E-02 

1,147,926 

15 

57,642,064 

rs4775177 

2.0E-04 

5.0E-04 

9.7E-03 

1,192,345 

15 

58,834,409 

rsl  871 858 

9.6E-02 

5.6E-04 

2.1E-02 

2,775,945 

15 

61,610,354 

rs982077 

1 .2E-02 

7.9E-04 

2.2E-02 

5,262 

15 

61,615,616 

rs8025240 

1 .5E-02 

7.7E-04 

1 .7E-02 

3,306 

15 

61,618,922 

rs2058913 

1 .9E-02 

7.4E-04 

2.2E-02 

234,644 

15 

61,853,566 

rs9972569 

1.2E-02 

4.5E-04 

1 .5E-02 

922,469 

15 

62,776,035 

rsl  69481 87 

3.8E-04 

3.2E-05 

1.8E-04 

954,912 

Fail 

15 

63,730,947 

rsl 694891 6 

2.4E-05 

1.9E-06 

7.4E-04 

551 ,933 

Fail 

15 

64,282,880 

rsl  1071 870 

4.7E-02 

1.9E-03 

3.9E-04 

20,842,888 

15 

85,125,768 

CGI  501 1200 

1.0E-03 

2.1E-02 

3.0E-01 

4,292,529 

15 

89,418,297 

rsl 7692973 

6.6E-04 

4.7E-03 

1 .6E-02 

2,162,716 

15 

91,581,013 

rs71 65051 

9.4E-04 

3.3E-03 

4.7E-04 

1,501,897 

15 

93,082,910 

rsl  7552225 

8.5E-02 

8.8E-04 

4.4E-01 

2,271 ,528 

15 

95,354,438 

rs234545 

6. IE-03 

1 .8E-02 

5.1E-04 

2,486,119 

15 

97,840,557 

rsl 020256 

8.4E-04 

2.4E-04 

6.2E-02 

16 

7,104,842 

rsl 3331 885 

2.4E-04 

3.7E-03 

2.0E-03 

1,418,464 

16 

8,523,306 

rsl  107731 5 

5.2E-03 

2.7E-04 

2.0E-02 

7,968 

16 

8,531 ,274 

rs6501156 

1.1E-03 

8.6E-04 

4.6E-03 

128,774 

16 

8,660,048 

rs9923816 

2.7E-04 

1.9E-03 

7.1E-02 

4,447,504 

16 

13,107,552 

CGI  6003244 

2.0E-04 

2.7E-02 

3.2E-01 

2,838 

16 

13,110,390 

rsl 7242582 

7.3E-04 

9.0E-02 

4.8E-01 

12,718,209 

16 

25,828,599 

rs4787783 

2.5E-03 

2.0E-02 

9.0E-04 

2,294,595 

16 

28,123,194 

rs7204453 

1 .6E-02 

8.6E-04 

4.6E-03 

352,985 

16 

28,476,179 

rsl  51 320 

1.0E-02 

8.7E-04 

7.7E-02 

28,206 

16 

28,504,385 

rsl  0521 145 

6.9E-03 

6.7E-04 

3.0E-02 

275,082 

16 

28,779,467 

rs7201929 

3.0E-02 

4.4E-04 

8.8E-02 

877,938 

16 

29,657,405 

rs71 93756 

2.1E-04 

1.9E-03 

3.0E-03 

2,890 

16 

29,660,295 

rs7190132 

4.2E-04 

2.9E-03 

3.7E-03 

1 ,963,807 

16 

31,624,102 

rs9938037 

2.5E-04 

1 .3E-02 

7.3E-03 

14,632 

16 

31,638,734 

rs9928006 

5.4E-05 

5.8E-03 

4.3E-03 

114,464 

Pass 

16 

31,753,198 

rs8048000 

3.2E-04 

1 .8E-02 

3.2E-02 

14,702 

16 

31,767,900 

rsl 2447780 

3.0E-05 

9.7E-03 

5.2E-03 

13,500,470 

Pass 

16 

45,268,370 

CGI  6005854 

6.0E-08 

4.5E-06 

3.1E-05 

3,725,654 

Fail 

16 

48,994,024 

rsl  16481 36 

1.2E-13 

8.0E-08 

1. IE-07 

287,564 

Fail 

16 

49,281 ,588 

rs9933594 

9.1E-04 

1 .3E-02 

6.3E-03 

1 ,504,658 

16 

50,786,246 

rs2245948 

7.3E-03 

4.9E-04 

5.1E-03 

426,121 

16 

51,212,367 

rs3 104805 

5.3E-04 

5.2E-03 

1.6E-01 

2,941 ,423 

16 

54,153,790 

rsl  501 999 

4.3E-03 

1.4E-03 

8.4E-04 

4,992,168 

16 

59,145,958 

rs6500133 

2.2E-02 

9.3E-04 

2.3E-02 

5,592,996 

16 

64,738,954 

rs71 95549 

3.9E-04 

8.3E-03 

1.2E-01 

11,773,565 

16 

76,512,519 

rsl  125693 

1.2E-02 

5.0E-04 

9.9E-03 

763,577 

16 

77,276,096 

rsl 6948307 

8.3E-03 

9.6E-04 

5.6E-02 

497,593 

16 

77,773,689 

rs8052087 

6.2E-04 

9.4E-04 

1.9E-03 

943,371 

16 

78,717,060 

rs2016178 

8.9E-04 

1 .3E-02 

1 .6E-02 

44,765 

16 

78,761 ,825 

rsl 6944269 

9.8E-03 

9.2E-02 

6.1E-04 

2,336,482 

16 

81,098,307 

rs964745 

8.1E-02 

1 .2E-02 

2.2E-04 

1,205,169 

16 

82,303,476 

rs9924248 

5.8E-04 

1. IE-01 

1 .3E-02 

3,182,433 

16 

85,485,909 

rs3934493 

1.8E-04 

3.3E-04 

2.7E-04 

894 

16 

85,486,803 

rs4075602 

3.0E-05 

6.6E-05 

3.1E-04 

375,927 

Pass 

16 

85,862,730 

rsl  1649456 

6.5E-03 

1.3E-04 

1 .0E-02 

1,900 

16 

85,864,630 

rsl 3336703 

7.6E-03 

1.6E-04 

6.1E-03 

1 ,290,388 

16 

87,155,018 

rs885065 

1. IE-03 

8.0E-04 

1.3E-01 

10,379 

16 

87,165,397 

rsl  1076678 

1.8E-02 

5.7E-04 

8.9E-02 

1,166,959 

16 

88,332,356 

rsl  230 

6.4E-03 

4.6E-04 

2.3E-02 

26,655 

16 

88,359,011 

rs2159116 

1 .4E-02 

3.9E-05 

5.6E-03 

4,011 

Pass 

16 

88,363,022 

rs4785595 

1 .0E-02 

7.2E-04 

3.2E-02 

3,164 

16 

88,366,186 

rs71 90403 

4.4E-02 

7.3E-05 

3.3E-02 

621 

Pass 

N0D2 


49,288,551 


TMEM87A 


SEMA6D 

SEMA6D 

SEMA6D 

SEMA6D 

SEMA6D 

SEMA6D 


0AZ2 

SLC24A1 


ZNF720 

VN1R3 

VPS35 

BRD7 

49,324,488 


TMEM87A 

39,883 

SEMA6D 

12,105 

SEMA6D 

16,203 

SEMA6D 

31,721 

SEMA6D 

7,983 

SEMA6D 

1,324 

SEMA6D 

18,900 

SEMA6D 

631 

SEMA6D 

9,872 

SEMA6D 

639 

SEMA6D 

11,044 

SEMA6D 

15,730 

SEMA6D 

2,032 

SEMA6D 

7,503 

SEMA6D 

SEMA6D 

5,262  USP3 

3,306  USP3 

USP3 

OAZ2 

SLC24A1 


14,632 

ZNF720 

ZNF720 

14,702 

VN1R3 

VN1R3 

VPS35 

BRD7 

NOD2 

NOD2 

894  Intergenic 

Intergenic  Intergenic 


26,655  FANCA 

FANCA  4,011  FANCA 

3,164  FANCA 

621  FANCA 


FANCA 


16 

16 

16 

16 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

17 

18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 


88,366,807 

rsl 25991 80 

1.1E-02 

6.1E-04 

2.8E-02 

6,744 

88,373,551 

rs3743859 

1 .3E-02 

6.6E-04 

3.1E-02 

119,792 

88,493,343 

rsl  01 53055 

6.8E-02 

9.9E-04 

1.0E-01 

164,128 

88,657,471 

rs4493039 

2.5E-02 

2.2E-04 

1.6E-01 

1 ,278,556 

rs8070640 

2.7E-05 

9.4E-04 

5.7E-05 

4,576 

Pass 

1,283,132 

rs8064892 

5.6E-05 

3.6E-03 

1.8E-04 

25 

Pass 

1,283,157 

rs8073032 

3.0E-04 

1 .6E-02 

4.2E-04 

26,873 

1,310,030 

rs7211083 

3.3E-05 

2.2E-03 

1.0E-04 

1,282,591 

Pass 

2,592,621 

rsl 7834974 

3.8E-02 

1.1E-02 

1.0E-04 

2,774,805 

5,367,426 

rsl  16561 83 

2.0E-03 

1 .3E-02 

7.7E-04 

417,892 

5,785,318 

rsl  1 831 41 

8.3E-04 

1.3E-03 

3.0E-03 

1,117,917 

6,903,235 

rs4796366 

5.4E-03 

4.6E-03 

4.6E-04 

2,368,497 

9,271,732 

rsl 7808941 

1.4E-04 

4.2E-02 

1.3E-04 

5,651 

9,277,383 

rs99 16292 

3.6E-04 

7.0E-02 

1.5E-04 

4,563,229 

13,840,612 

rs9893391 

6.6E-03 

8.8E-04 

1 .7E-02 

45,915 

13,886,527 

rs8077837 

4.9E-03 

5.4E-04 

8.5E-03 

387,184 

14,273,711 

rs6502358 

1.3E-02 

4.8E-04 

8.2E-02 

422,324 

14,696,035 

rsl  1652535 

1.2E-04 

4.4E-04 

3.2E-02 

318,971 

15,015,006 

rs8082487 

9.0E-02 

9.8E-04 

2.8E-02 

8,747,009 

23,762,015 

rs6505081 

4.1E-03 

7.8E-04 

6.2E-05 

4,932,612 

Pass 

28,694,627 

rsl 2449721 

1 .4E-02 

9.6E-04 

8.1E-02 

1 ,248,264 

29,942,891 

rsl  1654085 

2.9E-04 

9.4E-03 

6.8E-03 

3,202,256 

33,145,147 

rs3110648 

6.5E-02 

9.4E-04 

7.9E-02 

81,065 

33,226,212 

rs9900825 

2.7E-03 

1.0E-03 

2.8E-06 

949,510 

Pass 

34,175,722 

rsl  04351 5 

5.2E-04 

1.0E-01 

8.0E-04 

7,382 

34,183,104 

rs2338115 

3.3E-04 

6.2E-02 

6.6E-04 

4,126,667 

38,309,771 

rs2593595 

1.9E-04 

1 .0E-02 

8.0E-04 

1 ,547,447 

39,857,218 

rsl 869495 

7.3E-02 

7.5E-04 

9.5E-03 

4,485,825 

44,343,043 

rs999475 

7.6E-02 

5.0E-04 

8.1E-02 

20,502 

44,363,545 

rs4793993 

3.3E-02 

2.2E-04 

5.6E-02 

1 ,009,789 

45,373,334 

rs271661 

7.5E-04 

4.6E-03 

1.2E-01 

7,720,314 

53,093,648 

rs8065120 

6.7E-03 

1.8E-03 

1.1E-04 

526,368 

53,620,016 

rs9 17606 

1. IE-01 

1 .5E-02 

7.0E-04 

1 ,896,265 

55,516,281 

rs7211063 

1.9E-03 

4.6E-04 

1.6E-03 

7,193,754 

62,710,035 

rs6504495 

1.4E-04 

6.7E-05 

9.9E-03 

3,601,756 

Pass 

66,311,791 

rs3859276 

5.6E-04 

3.5E-03 

5.0E-03 

5,457 

66,317,248 

rsl 6976275 

1.5E-04 

7.9E-04 

1.6E-03 

1,220,156 

67,537,404 

rs21 93054 

9. IE-04 

3.1E-04 

1 .9E-02 

3,732,572 

rs3745050 

6.5E-03 

6.1E-04 

1 .2E-02 

182,768 

3,915,340 

rs935118 

9.0E-04 

1 .6E-02 

1 .3E-02 

621 

3,915,961 

rs4076011 

6.6E-05 

2. IE-03 

1 .9E-02 

157,339 

Pass 

4,073,300 

rs9963455 

2.8E-04 

3.9E-04 

5.0E-03 

261 

4,073,561 

rs9955521 

3.8E-04 

5.5E-04 

6.9E-03 

2,434,273 

6,507,834 

rs7504263 

9.9E-06 

1.6E-04 

1.6E-03 

924,770 

Pass 

7,432,604 

rsl 6951 745 

1.3E-03 

4.2E-04 

6.1E-03 

258 

7,432,862 

rs4798567 

4.5E-04 

1 .2E-02 

1 .9E-02 

5,779 

7,438,641 

rsl 6951 794 

2.2E-04 

4.5E-03 

8.1E-03 

3,362,835 

10,801,476 

rs8095854 

6.6E-02 

1.0E-01 

1.6E-04 

7,102,886 

17,904,362 

rsl 6962762 

2.0E-02 

7.2E-04 

7.2E-03 

3,089,416 

20,993,778 

rs9959583 

2.9E-03 

4.7E-04 

8.7E-03 

5,050 

20,998,828 

rs8085678 

6.3E-03 

6.3E-04 

8.2E-03 

590,495 

21,589,323 

rsl  971 61 3 

3. IE-03 

4.9E-04 

2.1E-03 

2,813,219 

24,402,542 

rsl 6944936 

2.7E-02 

9.5E-04 

1 .2E-02 

276,907 

24,679,449 

rsl  941 195 

3.1E-02 

3.9E-04 

2.1E-03 

4,864,390 

29,543,839 

rs7237757 

6.0E-04 

6.7E-03 

2.5E-02 

6,240,719 

35,784,558 

rs2862294 

2.9E-04 

5.5E-02 

2.6E-03 

24,870 

35,809,428 

rsl 7632270 

3.2E-04 

8.9E-02 

5.2E-03 

3,808 

35,813,236 

rs2568472 

1.4E-04 

5.3E-02 

2.2E-03 

14,801 

35,828,037 

rs394365 

2.8E-04 

9.3E-02 

5.9E-03 

959,659 

36,787,696 

rs7240599 

3.3E-03 

2.2E-04 

2.9E-03 

2,873,517 

39,661,213 

rsl 7677611 

3.0E-02 

8.5E-04 

1 .0E-02 

2,721,636 

42,382,849 

rs41 33965 

2. IE-03 

4.9E-04 

1 .2E-02 

2,453,249 

44,836,098 

rs2044550 

1.8E-04 

2.7E-03 

1.0E-01 

113,449 

44,949,547 

rsl 943000 

9. IE-04 

4.9E-03 

1.0E-01 

55,044 

45,004,591 

rsl 2606493 

6.9E-04 

2.8E-03 

8.9E-02 

13,220 

45,017,811 

rsl 6950465 

4.2E-04 

2.1E-03 

7.5E-02 

88,302 

45,106,113 

rsl 2606466 

3.4E-04 

2.9E-02 

1 .5E-02 

44,246 

45,150,359 

rsl 24551 99 

2.7E-04 

2.7E-02 

1 .5E-02 

87,659 

45,238,018 

rsl 2457899 

7.3E-04 

5.2E-02 

2.0E-02 

5,324 

45,243,342 

rsl 2456478 

6.9E-04 

4.3E-02 

2.0E-02 

206,230 

45,449,572 

rsl 2970803 

3.3E-04 

1.6E-03 

3.8E-03 

383,581 

45,833,153 

rsl 7801 909 

9.0E-04 

5.4E-03 

3.7E-03 

369 

45,833,522 

rsl 7728885 

2.5E-04 

2.3E-03 

7.0E-04 

416,034 

46,249,556 

rsl 780351 4 

4.7E-04 

1 .3E-02 

1 .0E-02 

1,017,362 

47,266,918 

rsl 656747 

3.5E-04 

2.3E-01 

1.6E-01 

6,045,862 

53,312,780 

rs7229018 

6.2E-03 

3.8E-03 

2.8E-04 

960,003 

54,272,783 

rs4940706 

6.3E-03 

3.1E-04 

2.1E-02 

1 ,362,493 

55,635,276 

rs9951126 

1.8E-04 

3.3E-02 

1  .IE-01 

5,991,803 

61,627,079 

rsl 994230 

3.4E-04 

2.1E-03 

4.5E-04 

4,447,184 

66,074,263 

rs8088698 

5.5E-04 

3.9E-02 

1.0E-01 

2,398,554 

68,472,817 

rsl 2962531 

2.7E-03 

9.5E-04 

4.9E-04 

18,204 

6,744 


FANCA 

FANCA 


CRK 

CRK 

CRK 


4,576 

25 

26,873 


CRK 

CRK 

CRK 

CRK 


SLC46A1 


SLC46A1 


HNF1B 


HNF1B 


PSMD12 


PSMD12 


DLGAP1 


621 


DLGAP1 

DLGAP1 


LOC645387 


258 

5,779 


LOC645387 

PTPRM 

PTPRM 

PTPRM 


24,870 

3,808 

14,801 


Intergenic 

Intergenic 

Intergenic 

Intergenic 


18 

68,491,021 

rs9962804 

1 .4E-01 

4.8E-02 

6.5E-04 

1,860,118 

18 

70,351,139 

rsl 2954438 

4.2E-04 

1.6E-04 

1 .0E-02 

1 ,584,586 

18 

71 ,935,725 

rs9947628 

7.6E-04 

5.5E-02 

3.1E-02 

6,057 

18 

71 ,941 ,782 

rs2897285 

9.9E-04 

6.3E-02 

2.2E-02 

4,050,652 

18 

75,992,434 

rsl  1662921 

1.2E-05 

2.3E-03 

6.4E-03 

68 

Pass 

18 

75,992,502 

rsl  1660007 

1.0E-05 

2.2E-03 

8. IE-03 

2,172 

Pass 

18 

75,994,674 

rsl  064059 

1.4E-05 

3.5E-03 

9.6E-03 

21,121 

Pass 

18 

76,015,795 

rs7229329 

7.9E-05 

6.6E-03 

2.5E-02 

1,102 

Pass 

18 

76,016,897 

rs3087701 

5.5E-05 

4.0E-03 

2.2E-02 

109 

Pass 

18 

76,017,006 

rsl  2201 

5.6E-05 

5.4E-03 

3.0E-02 

20,737 

Pass 

18 

76,037,743 

rsl  166481 6 

3.7E-04 

7.9E-03 

5.3E-02 

13,791 

18 

76,051 ,534 

rs8083877 

7.3E-04 

6.2E-03 

1 .3E-02 

19 

5,872,352 

rs274795 

4.9E-04 

8.7E-03 

6.3E-02 

1 ,944,502 

19 

7,816,854 

rs794442 

3.9E-03 

1.9E-04 

3.6E-05 

669,748 

Pass 

19 

8,486,602 

rs2042300 

2.3E-03 

5.2E-03 

2.0E-04 

2,708,693 

19 

11,195,295 

rs48041 55 

4.6E-03 

4.9E-04 

2.5E-03 

3,269,859 

19 

14,465,154 

rs8 109740 

2.4E-04 

3.3E-02 

3.6E-02 

1 ,408,603 

19 

15,873,757 

rs201 8460 

7.9E-05 

1.4E-03 

4.4E-03 

33,893 

Pass 

19 

15,907,650 

rs38 10427 

3.8E-04 

2.0E-02 

3.5E-02 

17,584,685 

19 

33,492,335 

CGI  9002489 

6.4E-04 

2.3E-02 

6.6E-02 

751,115 

19 

34,243,450 

rs7248438 

7.2E-02 

1 .5E-02 

7.2E-04 

1,411,994 

19 

35,655,444 

rs3786803 

2.2E-04 

2.3E-03 

4.7E-02 

409,530 

19 

36,064,974 

rsl 466297 

1.1E-02 

1 .7E-02 

4.9E-04 

2,268,622 

19 

38,333,596 

rsl 0423969 

4.5E-03 

5.5E-04 

3. IE-03 

2,562,934 

19 

40,896,530 

rsl  07068 

2.0E-02 

1.4E-04 

1.9E-03 

7,178,524 

19 

48,075,054 

rsl 0422560 

9.3E-05 

1 .0E-02 

1.6E-03 

5,850,164 

Poor 

19 

53,925,218 

rs281408 

3.1E-04 

6.8E-05 

2.3E-06 

16,292 

Pass 

19 

53,941,510 

rsl 2975781 

7.2E-06 

2.6E-05 

1.8E-07 

2,969,260 

Pass 

19 

56,910,770 

rsl  1667974 

2.2E-05 

3.7E-05 

6.8E-05 

2,633,546 

Fail 

19 

59,544,316 

rs4356595 

5.6E-04 

4.7E-04 

6.4E-02 

20 

2,391,441 

rs21 43862 

2.4E-03 

6.7E-03 

2.8E-04 

8,010 

20 

2,399,451 

rs6049288 

8.6E-03 

3.1E-02 

6.7E-04 

1 ,920,995 

20 

4,320,446 

rs6084726 

9. IE-03 

3.5E-04 

1.2E-01 

1,656,871 

20 

5,977,317 

rs6076920 

3.8E-03 

3.5E-04 

8.4E-04 

5,915,708 

20 

11,893,025 

rsl  99781 2 

1 .3E-02 

3.0E-02 

3.3E-04 

4,618,247 

20 

16,511,272 

rs61 35804 

4.4E-04 

1.7E-03 

8.3E-04 

66,207 

20 

16,577,479 

rs6111223 

7.1E-04 

2.9E-02 

2.7E-01 

2,260,206 

20 

18,837,685 

rs61 36562 

1.0E-03 

1 .3E-02 

1.1E-02 

64,764 

20 

18,902,449 

rs6035176 

3.8E-04 

4.2E-03 

2.4E-02 

4,287,318 

20 

23,189,767 

CG20005607 

1 .4E-01 

6.7E-02 

8.2E-04 

10,180,808 

20 

33,370,575 

rs6060369 

3.1E-01 

7.0E-01 

6.0E-04 

7,047 

20 

33,377,622 

rs6060373 

3.0E-01 

6.1E-01 

3.9E-04 

44,045 

20 

33,421 ,667 

rsl 570004 

2.5E-01 

4.8E-01 

2.8E-04 

13,800 

20 

33,435,467 

rs4911496 

2.4E-01 

4.6E-01 

1.9E-04 

129,768 

20 

33,565,235 

rs2236160 

5.8E-02 

3.8E-01 

7.1E-04 

6,392,944 

20 

39,958,179 

rs2076395 

5.0E-04 

3.1E-03 

1.8E-03 

218 

20 

39,958,397 

rs61 29968 

3.4E-04 

2.3E-03 

8.0E-04 

30,152 

20 

39,988,549 

rs481 0344 

9.5E-04 

2.3E-03 

4.4E-03 

10,590 

20 

39,999,139 

rs481 0345 

7.4E-04 

3.2E-03 

3.8E-03 

5,593,887 

20 

45,593,026 

rsl  21 2595 

1.9E-04 

2.3E-02 

2.4E-04 

1 ,354,944 

20 

46,947,970 

rs6019503 

7.4E-04 

7.3E-02 

2.7E-02 

32 

20 

46,948,002 

rs601 9504 

2.9E-04 

2.4E-02 

8.6E-03 

3,098,768 

20 

50,046,770 

rs6068038 

5.3E-05 

3.9E-03 

4.5E-03 

748,104 

Pass 

20 

50,794,874 

rs241800 

5.4E-03 

9.7E-04 

3.2E-02 

2,729 

20 

50,797,603 

rsl 7806723 

1.9E-04 

4.1E-04 

7.6E-03 

1,310 

20 

50,798,913 

rsl 7734029 

1. IE-04 

3.2E-04 

7.2E-03 

4,281 

20 

50,803,194 

rsl 7806747 

4.4E-04 

5.9E-04 

7.6E-03 

980,105 

20 

51,783,299 

rs6091737 

4.2E-03 

4.9E-04 

9.1E-02 

2,396,245 

20 

54,179,544 

rs6014619 

1.9E-01 

7.4E-04 

1 .3E-02 

403,215 

20 

54,582,759 

rs6069788 

2.3E-04 

2.5E-03 

2.0E-03 

308,279 

20 

54,891 ,038 

rs7261954 

7.1E-04 

3.4E-02 

2.1E-03 

659 

20 

54,891 ,697 

rs6025234 

7.5E-04 

2.8E-02 

1.5E-03 

1,362,133 

20 

56,253,830 

rs3787158 

1 .4E-02 

1 .5E-02 

6.2E-04 

278 

20 

56,254,108 

rs595292 

9.0E-02 

2.5E-02 

7.7E-04 

720,069 

20 

56,974,177 

rs235189 

6.6E-04 

2.1E-02 

3.3E-02 

1,644,211 

20 

58,618,388 

rs6027712 

6.4E-03 

1 .4E-02 

6.8E-04 

228,430 

20 

58,846,818 

rs601 5858 

1.0E-02 

6.4E-04 

9.1E-03 

1,079,641 

20 

59,926,459 

rs6061892 

8.6E-04 

2.7E-04 

5.5E-05 

871,132 

Poor 

20 

60,797,591 

rs6090338 

4.4E-04 

2.4E-04 

8. IE-05 

Pass 

21 

26,266,108 

rs2051174 

1.2E-01 

1.2E-01 

7.5E-04 

2,497,669 

21 

28,763,777 

rs2831839 

2.7E-03 

6.6E-04 

6.5E-02 

189,124 

21 

28,952,901 

rs2831951 

7.2E-03 

6.9E-02 

4.4E-04 

5,312,903 

21 

34,265,804 

rs2834310 

7.0E-04 

8.5E-04 

1.5E-01 

671 ,708 

21 

34,937,512 

rs2834556 

4.4E-02 

1. IE-01 

7.5E-04 

1 ,565,285 

21 

36,502,797 

rs2156412 

1.3E-04 

3.1E-02 

8.6E-02 

2,386,149 

21 

38,888,946 

rsl  71 941 86 

4.8E-02 

1.9E-03 

1.9E-04 

1,406 

21 

38,890,352 

rs2836550 

6.9E-02 

2.7E-03 

5.6E-04 

52 

21 

38,890,404 

rs22 12596 

1.0E-02 

6.2E-04 

4.6E-03 

17,711 

21 

38,908,115 

rs460574 

4.2E-02 

8.1E-04 

2.3E-02 

339,956 

21 

39,248,071 

rs9636950 

6.3E-05 

2.2E-03 

1.1E-02 

15,681 

Pass 

21 

39,263,752 

rs2836792 

3.4E-04 

8.1E-03 

1 .5E-02 

3,458 

ADNP2 

ADNP2 

ADNP2 

ADNP2 

ADNP2 

ADNP2 


EVI5L 


CYP4F11 


PSG1 

IZUM01 

IZUM01 

HAS1 


Intergenic 


CDH4 

SLC04A1 


FLJ45139 


68 

2,172 

21,121 

1,102 

109 

20,737 

13,791 


ADNP2 

ADNP2 

ADNP2 

ADNP2 

ADNP2 

ADNP2 

ADNP2 


EVI5L 


33,893 


CYP4F11 

CYP4F11 


16,292 


PSG1 

IZUM01 

IZUM01 

HAS1 


7,047 

44,045 

13,800 


UQCC 

UQCC 

UQCC 

UQCC 


218  LOC6431 72 

30,152  LOC6431 72 

10,590  LOC6431 72 

LOC6431 72 


2,729 

1,310 

4,281 


Intergenic 

Intergenic 

Intergenic 

Intergenic 

Intergenic 


CDH4 

SLC04A1 


1,406 

ERG 

52 

ERG 

17,711 

ERG 

ERG 

15,681 

FLJ45139 

3,458 

FLJ45139 

21 

39,267,210 

rs2836795 

2.8E-05 

2.7E-03 

4.3E-03 

51 

Pass 

21 

39,267,261 

rs2836796 

3.5E-05 

3.2E-03 

2.7E-03 

93 

Pass 

21 

39,267,354 

rs2836797 

9.9E-05 

6.1E-03 

4.9E-03 

4,177 

Pass 

21 

39,271,531 

rs2836801 

8.1E-05 

5.3E-03 

1 .3E-02 

99,228 

Pass 

21 

39,370,759 

rsl  7231 256 

5.2E-04 

4.5E-03 

5.6E-03 

486,752 

21 

39,857,511 

rs571227 

9.3E-02 

3.0E-01 

9.9E-04 

2,288 

21 

39,859,799 

rs523372 

1.7E-01 

3.6E-01 

8.9E-04 

8,195 

21 

39,867,994 

rsl 734930 

1.6E-01 

9.6E-02 

4.2E-04 

2,638,268 

21 

42,506,262 

rs221 948 

6.9E-04 

3.9E-02 

1.5E-01 

573,947 

21 

43,080,209 

rs883471 

7.8E-04 

1.3E-01 

1.7E-01 

1,917,610 

21 

44,997,819 

rs690260 

3.3E-04 

3.7E-03 

9.5E-04 

1 ,842,295 

21 

46,840,114 

rs2839342 

3.6E-06 

1.9E-05 

3.4E-04 

Fail 

22 

19,542,090 

rsl  78067 

6.6E-04 

1.0E-03 

2.4E-05 

5,096,146 

Fail 

22 

24,638,236 

rsl 476053 

2.5E-08 

5.3E-06 

3.9E-05 

3,186,388 

Fail 

22 

27,824,624 

rs2294242 

7.7E-03 

3.6E-04 

7.3E-02 

144,466 

22 

27,969,090 

rs5763087 

3. IE-06 

1.2E-05 

1.4E-03 

549,155 

Poor 

22 

28,518,245 

rs5752962 

9.7E-04 

2.9E-02 

5.9E-03 

62,067 

22 

28,580,312 

rs5752972 

3.0E-04 

3.9E-02 

8.5E-03 

63,780 

22 

28,644,092 

rs713718 

1.4E-04 

2.6E-02 

4.4E-03 

79,411 

22 

28,723,503 

rs5763688 

2.5E-04 

1 .5E-02 

9.5E-03 

12,480 

22 

28,735,983 

rs41158 

2.0E-05 

1.5E-03 

2.0E-03 

1,381 

Pass 

22 

28,737,364 

rs41159 

4.2E-04 

2.1E-02 

1 .2E-02 

25,039 

22 

28,762,403 

rs41176 

3.5E-04 

2.2E-02 

1.6E-03 

39,851 

22 

28,802,254 

rsl 989870 

2.4E-04 

1 .5E-02 

2.7E-03 

58,837 

22 

28,861,091 

rs4820830 

6.0E-04 

3.5E-02 

7.5E-03 

36,816 

22 

28,897,907 

rs2412976 

4.2E-04 

2.6E-02 

4.5E-03 

5,892,113 

22 

34,790,020 

rs5995203 

5.0E-03 

2.7E-04 

4.7E-05 

1 ,350,633 

Pass 

22 

36,140,653 

rsl  33738 

2.8E-04 

1 .3E-02 

5.2E-02 

1,463 

22 

36,142,116 

rsl  33736 

5.6E-04 

3.7E-02 

6.1E-02 

1 ,687,434 

22 

37,829,550 

rsl  3931 4 

2.0E-03 

9.7E-03 

6.9E-04 

8,068,718 

22 

45,898,268 

rs738662 

4.7E-03 

2.3E-04 

5.8E-02 

11,341 

22 

45,909,609 

rs5767523 

3.2E-03 

4.4E-04 

4.9E-02 

3,526,470 

22 

49,436,079 

rs6009945 

2. IE-04 

3.0E-03 

1 .6E-02 

23 

11,715,814 

rs5935164 

8.6E-04 

2.1E-02 

1.2E-01 

4,763,948 

23 

16,479,762 

rs2428447 

3.4E-02 

5.6E-02 

5.3E-04 

641 ,837 

23 

17,121,599 

rs7049284 

7.5E-04 

1 .4E-02 

5.2E-04 

10,344,080 

23 

27,465,679 

rs221406 

8.2E-04 

2.0E-03 

1 .9E-02 

5,041 ,354 

23 

32,507,033 

rs3928369 

9.0E-04 

9.6E-03 

1 .3E-02 

1,069,617 

23 

33,576,650 

rs2382963 

9.1E-04 

6.0E-03 

2.2E-02 

47,480,190 

23 

81,056,840 

rs4131395 

2.1E-04 

5.3E-04 

1.1E-02 

833,095 

23 

81,889,935 

rs4828519 

5.2E-04 

2.6E-03 

9.6E-04 

1 ,668,266 

23 

83,558,201 

rs4437785 

1.0E-05 

2.3E-05 

8.0E-04 

667,407 

Pass 

23 

84,225,608 

rs5923256 

1.9E-04 

3.4E-03 

5.8E-03 

730,966 

23 

84,956,574 

rs2206006 

1.8E-05 

6.6E-03 

3.1E-02 

409,559 

Pass 

23 

85,366,133 

rs5923512 

6.8E-04 

1 .9E-02 

1 .3E-02 

590,578 

23 

85,956,711 

rsl  1797054 

4.7E-04 

9.3E-03 

1.3E-03 

289,260 

23 

86,245,971 

rs5923792 

1.8E-07 

4.8E-06 

5.1E-04 

394,107 

Pass 

23 

86,640,078 

rs22 13671 

2.5E-06 

2.5E-04 

2.8E-02 

778,740 

Pass 

23 

87,418,818 

rs5969586 

2.7E-04 

4.9E-03 

3.0E-02 

96,500 

23 

87,515,318 

rs6522039 

1.6E-04 

8.4E-04 

5.8E-03 

82,174 

23 

87,597,492 

rsl 2842964 

2.7E-04 

2.0E-03 

2.5E-02 

469,070 

23 

88,066,562 

rs682176 

3.0E-06 

7.2E-04 

8.8E-04 

73,854 

Pass 

23 

88,140,416 

rs3850155 

1.5E-05 

5.8E-04 

5.5E-03 

151,629 

Pass 

23 

88,292,045 

rs5942420 

2.9E-05 

1.8E-03 

1 .3E-02 

3,026,341 

Pass 

23 

91,318,386 

rs4358964 

9.2E-05 

2.2E-02 

8.7E-02 

519,886 

Pass 

23 

91 ,838,272 

rs5942313 

9.9E-06 

3.4E-03 

1.2E-04 

469,497 

Pass 

23 

92,307,769 

rs785754 

5.8E-06 

1.7E-06 

4.8E-05 

166,584 

Pass 

23 

92,474,353 

rs5940144 

2.4E-08 

1.4E-07 

6.5E-05 

369,783 

Pass 

23 

92,844,136 

rs23 12033 

4.3E-04 

3.1E-02 

6.7E-02 

158,907 

23 

93,003,043 

rsl 797020 

4.5E-04 

1.5E-03 

1.2E-01 

618,270 

23 

93,621,313 

rs2782710 

2.2E-04 

1.3E-03 

1.3E-01 

607 

23 

93,621 ,920 

rsl 582247 

2.1E-04 

1.9E-03 

1. IE-01 

640,017 

23 

94,261 ,937 

rs7061674 

6. IE-06 

3.3E-03 

2.4E-03 

59,436 

Pass 

23 

94,321 ,373 

rs661 9774 

9.9E-06 

1.7E-06 

1.3E-05 

16,569 

Pass 

23 

94,337,942 

rsl 0855336 

9. IE-03 

6.2E-04 

5.8E-04 

22,841 

23 

94,360,783 

rs5950249 

6.6E-07 

2.5E-05 

3.3E-06 

410,820 

Pass 

23 

94,771 ,603 

rs21 53931 

1. IE-04 

1.3E-04 

2.2E-03 

1,175,421 

23 

95,947,024 

rs233655 

3.9E-04 

6.2E-03 

7.3E-03 

1 ,360,032 

23 

97,307,056 

rsl  458782 

4.5E-06 

4.3E-03 

2.8E-04 

53,894 

Pass 

23 

97,360,950 

rsl  458780 

5.4E-06 

2.7E-03 

4.4E-04 

2,394,686 

Pass 

23 

99,755,636 

rs736550 

3.8E-06 

2.8E-03 

1 .3E-02 

1 ,465,954 

Pass 

23 

101,221,590 

rs5944840 

7.0E-04 

1 .9E-02 

3.2E-03 

1 ,666,251 

23 

102,887,841 

rs532860 

7.2E-06 

1.5E-05 

3.6E-04 

5,810,170 

Pass 

23 

108,698,011 

rs942602 

3.4E-04 

1 .5E-02 

2.4E-02 

461,501 

23 

109,159,512 

rs2066364 

4.3E-06 

1.5E-06 

9. IE-06 

2,708,220 

Pass 

23 

111,867,732 

rsl 6987072 

9.2E-05 

2.4E-03 

1. IE-03 

194,988 

Pass 

23 

112,062,720 

rs5974328 

5.5E-05 

6.5E-04 

1.1E-05 

123,665 

Pass 

23 

112,186,385 

rs3 125994 

8.6E-07 

2.5E-05 

4.5E-04 

461 ,253 

Pass 

23 

112,647,638 

rs5929302 

1. IE-05 

9.9E-04 

3.6E-05 

458,450 

Pass 

23 

113,106,088 

rs6642665 

1.2E-04 

3.8E-04 

4.8E-04 

2,603,900 

23 

115,709,988 

rs5909877 

2.6E-05 

7.6E-04 

1.9E-03 

41,418 

Pass 
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23 

115,751,406 

rsl 7280008 

7.3E-04 

8.2E-03 

7.9E-03 

221,651 

23 

115,973,057 

rs6646568 

5.9E-06 

1.8E-03 

2.8E-03 

39,012 

Pass 

23 

116,012,069 

rs4462068 

1. IE-06 

9.1E-03 

1.1E-04 

631,197 

Pass 

23 

116,643,266 

rs2878761 

2.0E-04 

2.1E-02 

1 .0E-02 

17,805 

23 

116,661,071 

rs59 12059 

6.9E-04 

1 .2E-02 

2.4E-03 

129,799 

23 

116,790,870 

rs2215785 

1.3E-05 

4.2E-03 

5.2E-03 

538,714 

Pass 

23 

117,329,584 

rsl  71 6765 

1.8E-04 

5.1E-04 

6.1E-02 

98,400 

23 

117,427,984 

rs5956974 

3.4E-04 

1 .2E-02 

2.5E-03 

58,424 

23 

117,486,408 

rs5956063 

7.2E-04 

3.2E-03 

8.3E-04 

130,996 

23 

117,617,404 

rs59 10392 

5.8E-05 

3.1E-05 

1.3E-03 

1,743,170 

Pass 

23 

119,360,574 

rs591 0840 

1.8E-06 

4.3E-04 

2.0E-01 

339,656 

Pass 

23 

119,700,230 

rs4825716 

3.4E-07 

2.9E-06 

9.2E-06 

741 ,224 

Pass 

23 

120,441,454 

rs6608252 

3.3E-11 

2.9E-07 

2.4E-04 

226,422 

Pass 

23 

120,667,876 

rs2507247 

1.0E-04 

2.9E-05 

2.3E-03 

623,489 

Pass 

23 

121,291,365 

rs6648753 

3. IE-05 

2.0E-02 

5.1E-02 

39,452 

Pass 

23 

121,330,817 

rsl 238731 8 

6.4E-06 

1 .9E-02 

2.1E-02 

709,591 

Pass 

23 

122,040,408 

CG23007712 

7.1E-04 

9.9E-04 

5.4E-05 

1,513 

Pass 

23 

122,041,921 

rs924159 

5.2E-04 

1.8E-03 

2.7E-05 

252,298 

Pass 

23 

122,294,219 

rsl  0521 71 8 

1.3E-04 

1 .7E-02 

3.1E-02 

121,033 

23 

122,415,252 

rs6608087 

7.1E-04 

2.9E-03 

1.5E-02 

202,700 

23 

122,617,952 

rs5958281 

7.1E-09 

2.5E-04 

6.7E-05 

660,357 

Pass 

23 

123,278,309 

rsl  292210 

4.8E-04 

6.6E-03 

5.7E-02 

15,898 

23 

123,294,207 

rs5911809 

5. IE-04 

8. IE-03 

6.4E-02 

54,302 

23 

123,348,509 

rs5956652 

8.0E-04 

4.4E-03 

1.3E-04 

24,249 

23 

123,372,758 

rs2239477 

2.8E-04 

1 .6E-02 

1.7E-03 

1 ,644,003 

23 

125,016,761 

rsl 7266022 

9.4E-05 

1.1E-03 

1.4E-03 

117,143 

Pass 

23 

125,133,904 

rs5933374 

1.6E-05 

3.9E-03 

1.9E-02 

187,478 

Pass 

23 

125,321,382 

rs5930842 

1. IE-06 

5.5E-06 

4.7E-05 

395,336 

Pass 

23 

125,716,718 

rs7879025 

1.6E-05 

7.4E-06 

1. IE-03 

1 ,000,041 

Pass 

23 

126,716,759 

rsl 23941 58 

6.9E-04 

2.3E-03 

4.1E-03 

1 ,356,904 

23 

128,073,663 

rs7882525 

5.1E-03 

2.9E-02 

4.6E-04 

3,427 

23 

128,077,090 

rs763301 

5.1E-05 

3.3E-03 

1.7E-05 

115,636 

Pass 

23 

128,192,726 

rs21 03520 

1.7E-05 

2.3E-04 

5.3E-04 

418,522 

Pass 

23 

128,611,248 

rs909657 

3.2E-05 

1.7E-03 

2.1E-04 

1 ,647,752 

Pass 

23 

130,259,000 

rs6529475 

1.2E-04 

2.1E-04 

9.2E-04 

499,424 

23 

130,758,424 

rs5977554 

4.7E-04 

4.1E-03 

9.9E-04 

3,538,940 

23 

134,297,364 

rsl 2689508 

4.2E-03 

9.2E-04 

9.1E-03 

312,668 

23 

134,610,032 

rs5975558 

1.5E-04 

7.8E-03 

2.6E-02 

768,678 

23 

135,378,710 

rsl 203592 

1.5E-05 

8.9E-04 

5.8E-03 

1 ,234,749 

Pass 

23 

136,613,459 

rs2366299 

1.7E-06 

6.4E-04 

1.2E-03 

509,201 

Pass 

23 

137,122,660 

rs677119 

4.8E-05 

5.1E-03 

2.6E-04 

2,747,331 

Pass 

23 

139,869,991 

rsl  1095848 

4.2E-06 

1.9E-04 

1. IE-03 

233,160 

Pass 

23 

140,103,151 

rs5907830 

9.9E-04 

1 .5E-02 

5.4E-02 

594,459 

23 

140,697,610 

rs5908046 

7.5E-05 

1 .4E-02 

4.4E-02 

291 ,098 

Pass 

23 

140,988,708 

rs5907221 

3.4E-06 

9.1E-05 

1.6E-04 

190,936 

Pass 

23 

141,179,644 

rs4824895 

4.6E-04 

1 .8E-02 

3.2E-03 

144,480 

23 

141,324,124 

rs766118 

6.0E-06 

1.9E-01 

5.7E-02 

1 ,086,953 

Pass 

23 

142,411,077 

rsl 6979343 

3.2E-04 

1.6E-03 

3.8E-04 

164,918 

23 

142,575,995 

rsl 2557229 

1.3E-04 

2.8E-02 

7.6E-01 

7,651 

23 

142,583,646 

rsl  72591 81 

1.4E-04 

3.7E-02 

6.9E-01 

1,010,504 

23 

143,594,150 

rsl  1094321 

6.1E-06 

3.3E-04 

2.1E-02 

1,437,140 

Pass 

23 

145,031,290 

rs5920016 

3.0E-04 

2.1E-02 

7.1E-02 

162,626 

23 

145,193,916 

rsl 339482 

4.7E-04 

4.9E-02 

2.8E-02 

412,752 

23 

145,606,668 

rs409263 

4.8E-04 

1.0E-03 

4.1E-03 

555,375 

23 

146,162,043 

rs5905031 

8. IE-05 

1 .2E-02 

4.9E-03 

234,297 

Pass 

23 

146,396,340 

rs524558 

6.3E-04 

1.7E-03 

3.1E-02 

528,831 

23 

146,925,171 

rs5904849 

7.6E-04 

7.7E-04 

2.9E-02 

209,797 

23 

147,134,968 

rs5936450 

5.7E-05 

5.0E-04 

2.3E-04 

158,645 

Pass 

23 

147,293,613 

rs5936352 

5.2E-05 

8.4E-04 

9.0E-03 

194,407 

Pass 

23 

147,488,020 

rs5980375 

3. IE-08 

6.9E-05 

4.7E-04 

10,889 

Pass 

23 

147,498,909 

rs7881705 

3.0E-09 

1.7E-05 

4.6E-05 

388,892 

Pass 

23 

147,887,801 

rs6641482 

4.0E-04 

3.4E-03 

8.7E-03 

224,162 

23 

148,111,963 

rs5980419 

1.4E-05 

5.4E-03 

5.6E-03 

208,770 

Pass 

23 

148,320,733 

rsl  7371 

8.1E-05 

2.2E-03 

4.5E-04 

1 ,266,787 

Pass 

23 

149,587,520 

rsl 0521 889 

1.3E-05 

2.9E-05 

7.2E-05 

1 ,068,950 

Pass 

23 

150,656,470 

rs5970100 

1.8E-04 

1 .2E-02 

1.0E-01 

1,082,139 

23 

151,738,609 

rs6627265 

4.8E-04 

4.3E-05 

1. IE-03 

418,541 

Pass 

23 

152,157,150 

rsl 894344 

7.6E-08 

4.1E-05 

4.4E-05 

228,318 

Pass 

23 

152,385,468 

rs5945399 

2.5E-05 

3.7E-04 

1.9E-03 

92,518 

Pass 

23 

152,477,986 

rs2980017 

9.7E-05 

4.7E-03 

8.0E-02 

92,824 

Poor 

23 

152,570,810 

rs6571284 

6.8E-06 

8.0E-04 

2.3E-02 

633,767 

Pass 

23 

153,204,577 

rs5987245 

7.7E-05 

1.2E-03 

1 .5E-02 

Pass 

24 

735,289 

rs6579619 

5.7E-03 

2.1E-04 

1 .2E-02 

1 ,745,832 

24 

2,481,121 

rs5982552 

3.3E-04 

8.4E-03 

5.0E-03 
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Application  for  Access  to  Genotype  Data 


Name  of  applicant  and  co-applicant(s),  including  affiliations  and  contact  details. 

Please  ensure  that  both  telephone  numbers  and  email  addresses  are  included. _ 

Massimo  Trucco,  MD  (Applicant) 

Children's  Hospital  of  Pittsburgh 
Pittsburgh,  PA  15213  USA 
TEL:  (412)  692-6570 
FAX:  (412)  692-5809 
E-mail:  mnt@pitt.edu 

Bernie  Devlin,  Ph.D.  (Co-Applicant) 

Department  of  Psychiatry 

University  of  Pittsburgh  School  of  Medicine 

Pittsburgh,  PA  15213  USA 

TEL:  (412)  246-6642 

FAX:  (412)  624-8181 

E-mail:  devlinbj@upmc.edu 

Kathryn  Roeder,  Ph.D.  (Co-Applicant) 

Department  of  Statistics 
Carnegie  Mellon  University 
Pittsburgh,  PA  15213 
TEL:  (412)  268-2513 
FAX:  (412)  268-7828 
E-mail:  roeder@stat.cmu.edu 


Title  of  Project 

In  less  than  30  words. 

DETECTION  OF  SUSCEPTIBILITY  LOCI  IN  MULTIFACTORIAL  DISEASE 


Genotype  Data  Requested 

Please  indicate  which  disease  and/or  control  genotypes  you  require. 


Access  to  Data  form,  version  1 ,  Nov06 


Type  1  Diabetes 
Type  2  Diabetes 
Bipolar  Disorder 
Rheumatoid  Arthritis 
Coronary  Heart  Disease 
Hypertension 

Inflammatory  Bowel  Disease 

1958  British  Birth  Cohort  controls 
UK  Blood  Service  controls 

Quantile  Normalised  Data  for  Control  and  Disease  Samples 


Research  Question 

Please  give  a  brief  description  of  the  project  and  its  specific  aims  in  no  more  than  500  words. 
This  should  include  specific  details  of  what  you  plan  to  do  with  the  data  and  include  key 
references. 


Access  to  Data  form,  version  1 ,  Nov06 


As  platforms  for  genome-wide  association  studies  (GWAS)  become  standardized,  numerous 
resources  will  become  available.  Analyses  of  GWAS  are  enhanced  by  the  availability  of  large, 
well  characterized,  populations  such  as  the  WTCCC  cohort.  How  to  exploit  these  datasets 
effectively  is  an  open  question.  As  part  of  our  work  into  the  genetics  of  Type  1  Diabetes 
(T1D)  we  have  developed  a  method  to  match,  by  genetic  ancestry,  controls  to  cases  (1). 
Applying  the  method  to  the  GWAS  obtained  from  our  cohort  of  T1D  (N=394)  and  non-TID 
(N=2,144)  participants  we  have  shown  that  GEnetic  Matching  (GEM)  resulted  in  a  reduced 
rate  of  false  positives  as  well  as  highlighting  of  true  positive  signals  for  association. 

The  goal  is  to  exploit  our  genetic  data  by  combining  it  with  the  WTCCC  resource  while 
developing  approaches  that  improve  power  of  association  studies  to  identify  true  signals.  We 
are  proceeding  with  a  staged  analysis  for  T1D  using  an  independent  cohort  of  cases 
(N=1,300)  and  controls  (N=2,500)  recruited  among  European-Americans  and  residents  from 
Germany.  Development  of  GEM  as  a  method  for  elucidating  association  between  SNP  and 
T1  D  has  led  us  to  anticipate  that  combining  our  cohort  with  the  WTCCC  dataset  will  result  in 
improved  power  to  detect  genes  influencing  disease. 

Substantial  progress  has  been  made  toward  the  completion  of  our  goal.  In  our  recent 
publication  (1)  we  developed  GEM  as  a  means  to  control  for  the  impact  of  ancestry  on 
association  testing,  and  showed  how  using  GEM  allows  multiple  datasets  to  be  combined  for 
discovery  of  genetic  association.  We  contrasted  this  approach  with  other  popular  methods  for 
controlling  the  spurious  effects  of  ancestral  heterogeneity,  namely  Eigenstrat  (2).  Compared 
with  Eigenstrat,  the  GEM  algorithm  improved  data  analysis,  highlighting  true  associations  by 
reducing  the  number  of  confounding  signals  linked  to  false  positives.  In  fact,  genetic  matching 
identified  confirmed  T 1  D  loci  CTLA4,  IL2RA,  and  PTPN22. 

Our  experimental  plan  is  to  use  the  combined  results  from  GWAS  performed  in  Pittsburgh  and 
by  WTCCC  to  optimize  GEM  methodology,  and  apply  the  GEM  algorithm  during  analysis  of 
the  complete  WTCCC  dataset.  We  anticipate  that,  when  the  Pittsburgh  'common'  controls  are 
combined  with  the  WTCCC  dataset,  the  method  will  increase  power  to  detect  true  association. 
In  order  to  accomplish  these  goals  we  request  access  to  the  complete  WTCCC  dataset  of 
common  diseases  and  controls.  Our  initial  focus  will  be  to  enhance  the  power  of  the  GWAS 
performed  in  Pittsburgh  and  by  WTCCC.  We  will  then  direct  our  efforts  towards  exploiting  the 
complete  WTCCC  dataset  delivering  an  analysis  with  increased  power  to  detect  true  signals 
for  association  with  T1D.  This  will  be  our  primary  analysis.  If  our  results  are  promising, 
secondary  analyses  will  target  each  of  the  disease  sets  in  turn,  using  the  other,  appropriate 
data  sets  as  controls.  Finally  it  is  possible  that  some  of  the  WTCCC  common  diseases  have 
etiological  factors  in  common.  For  example  it  is  plausible  that  T1D,  rheumatoid  arthritis,  and 
inflammatory  bowel  disease  inherit  risk  from  a  common  locus.  Joint  analyses  using  GEM 
methods  will  be  useful  to  evaluate  this  possibility. 

1.  Luca  et  al.  On  the  use  of  general  control  samples  for  genome-wide  association  studies: 
genetic  matching  highlights  causal  variants.  Am  J  Hum  Genet  (in  press). 

2.  Price  et  al.  Principal  components  analysis  corrects  for  stratification  in  genome-wide 
association  studies.  Nat  Genet  2006  38:904-909. 
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Introduction 

This  review  presents  a  brief  description  of  the  Selection  of 
Oligonucleotide  Primer  for  PCR  and  Pyrosequencing  (SOP3) 
software.  The  web-based  application  enables  efficient  design  of 
polymerase  chain  reaction  (PCR)  and  pyrosequencing  primers 
that  can  be  used  for  analysis  of  genetic  polymorphisms,  including 
single  nucleotide  polymorphisms  (SNPs).  The  software 
automates  the  process  of  retrieving  genomic  DNA  sequences  and 
locations  of  SNPs  from  the  human  genome  database,  and  takes 
into  consideration  specific  restrictions  necessary  during  the 
design  of  pyrosequencing  assays.1,2  The  application  aids  in  the 
sorting  of  SNPs  for  biologically  interesting  properties  (e.g.,  the 
association  of  a  SNP  with  a  particular  genetic  structural  element 
and  whether  it  codes  for  an  amino  acid  change  in  the  resulting 
protein,  as  well  as  its  heterozygosity  within  a  population)  that,  in 
turn,  adds  to  the  usefulness  of  particular  primer  sets  during 
physical  mapping  studies. 

Pyrosequencing  was  developed  for  the  analysis  of  expressed 
sequence  tags  (EST),  enabling  accurate  sequencing  of  short 
stretches  of  DNA.  This  methodology  has  found  increasing 
acceptance  as  a  method  for  analysis  of  SNPs, 
insertion/deletion  polymorphisms,  and  for  investigating  sites 
of  DNA  methylation.  Sequencing  of  a  maximun  length  of  40 
nucleotides  is  commonly  performed  while  a  maximum  of  150 
nucleotides  sequenced  has  been  reported.3,4  When  applied  to 
genotyping  human  leukocyte  antigen  (HLA),  the  method  is 
capable  of  distinguishing  between  allelic  pairs  commonly 
found  to  be  ambiguous  using  other  methods  [e.g.,  sequence 
specific  oligonucleotide  probe  (SSOP)  and  sequence  base 
typing  (SBT)  genotyping  methods.]4  Pyrosequencing  allows 
increased  resolution  of  allelic  pairs  due  to  its  ability  to 
accurately  resolve  heterozygous  nucleotides  by  enabling  out- 
of-phase  sequencing.5  Performed  by  stepwise  addition  of 
dNTPs,  pyrosequencing  enables  synthesis  of  the  nascent 
nucleotide  chain  that  is  extended  one  nucleotide  residue  per 
dispensation  step.  Detection  of  the  nucleotide  sequence  is 
performed  by  way  of  a  chain  of  enzymatic  reactions  involving 
DNA  polymerase,  apyrase,  ATP  sulfurylase,  as  well  as 
luciferase.3,6  Each  nucleotide  is  dispensed  independently  into 
the  reaction  mix.  The  graphic  display,  depicting  the 
incorporation  of  a  particular  nucleotide,  is  in  the  form  of  a 
nucleotide  dispensation  event  (x-axis)  versus  the  intensity  of 
emitted  light  (y-axis)  and  is  referred  to  as  a  pyrogram  (Figure 


1).  The  pyrosequencing  reaction  is  quantitative,  in  that 
increased  light  intensity  is  produced  upon  incorporation  of 
multiple  nucleotides.  Thus,  DNA  sequence  is  determined  by 
examination  of  light  intensity  emitted  immediately  after 
nucleotide  dispensation  and  is  known  as  sequencing  hy 
synthesis.  Positions  of  polymorphic  sequence  are  identified  due 
to  their  exhibiting  variable  incorporation  patterns  as 
determined  by  the  genotype. 

Software  for  Pyrosequence  Primer  Design 

SOP3  is  a  web-based  software  application 
(http://imgen.ccbb.pitt.edu/sop3)  that  has  been  specifically 
developed  for  use  in  pyrosequence  assay  design.1,2  The 
Webserver  hosts  an  integrated  biological  database  consisting  of 
genetic  variants  as  well  as  the  DNA  sequence  of  the  human 
and  mouse  genomes,  including  validated  human  SNPs 
contained  within  the  HapMap  comparative  genome  project. 
The  software  provides  a  batch  analysis  tool  that  accepts  the 
names  of  as  many  as  100  genetic  loci,  100  SNP  identifiers,  or 
a  100  kb  section  of  genomic  DNA  under  investigation.  The 
application  returns  a  set  of  primer  trios  for  PCR  (2  primers) 
and  pyrosequencing  ( 1  primer)  for  the  corresponding  genetic 
variants  listed  in  dbSNP.  Genomic  DNA  sequences  are  stored 
on  the  web  server  making  them  readily  accessible  to  the 
application.  Detailed  instructions  for  using  SOP3  are 
available  online  using  the  Assay  Design  User  Guide 
(http://imgen.ccbb.pitt.edu/sop3/entry_box.html). 

The  graphical  interface  for  the  SOP3  application  provides  an 
array  of  user-defined  settings  to  allow  customization  of  primer 
trio  design.  Genomic  sequences  from  queried  polymorphisms 
and  their  flanking  sequences  are  returned  to  the  SOP3 
application’s  core  algorithm  and  enable  the  selection  of 
suitable  primer  trios.  Choice  of  PCR  primers  is  based  on 
multiple  criteria  including  the  distance  between  PCR  primer 
annealing  sites  and  the  SNP,  as  well  as  allowing  for  the 
selection  of  optimal  genomic  flanking  sequence  for  annealing 
of  sequencing  primer.  Additional  criteria  for  maximizing  PCR 
amplification  yield  are  included.  For  example,  the  search 
identifies  suitable  primers  at  multiple  annealing  sequences  on 
either  side  of  the  SNP.  The  list  of  candidate  PCR  primers  is 
evaluated  for  their  ability  to  form  a  competing  secondary 
structure  that  may  interfere  with  primer  annealing;  the 
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Figure  1 

Results  of  the  query  for  SNP  rs4081559  contained  within  intron  2  of  HLA-B  as  obtained  from  the  SOP3  application.  Upper  Panel.  The  output  summarizes  the  SNP  under  consideration 
by  referencing  the  dbSNP  identifier  (i.e.,  rs4081559)  and  provides  the  recommended  sequence  of  primers  for  PCR  and  pyrosequencing.  Middle  Panel.  The  location  of  SNP  and  primers  are 
indicated  in  blue  (query  SNP),  red  (pyrosequencing  primer),  and  yellow  (PCR  primers).  The  region  amplified  during  PCR  is  indicated  in  gold.  The  locations  of  nearby  SNPs  are  indicated 
(pink).  Bottom  Panel.  The  anticipated  pyrograms  from  each  genotype  are  shown.  Pyrograms  plot  the  nucleotide  dispensation  order  (x-axis)  versus  the  light  emitted  (y-axis)  during 
sequencing  by  synthesis  steps  of  pyrosequencing. 


predicted  stability  of  primer  annealing  is  monitored  by 
calculating  the  melting  temperature  (Tm);  sequence 
complexity  is  measured  by  maintaining  representative 
frequencies  of  each  nucleotide  residue  in  the  final  primer 
sequence.  Identification  of  nearby  polymorphic  sites  is 
performed  in  order  to  improve  the  design  of  primer  trios  for 
sequencing  as  well  as  to  enable  possible  multiplexing  of  assays 
for  genotyping  these  residues. 

The  software  generates  lists  of  candidate  forward  and  reverse 
PCR  primers  that  pass  the  individual  tests.  These 
oligonucleotide  sequences  are  then  compared  in  order  to 
obtain  compatible  primer  pairs  for  use  during  PCR 
amplification.  This  is  accomplished  by  analyzing  primers  for 
regions  of  reverse  complementary  sequence  that  may  initiate 
primer  dimer  amplification.  Final  optimization  of  PCR  primers 
is  accomplished  by  modifying  the  PCR  forward  primer  by 
addition  of  an  extra-genomic  nucleotide  motif  to  the  5’-end. 
This  results  in  suppression  of  competing  sequencing  reactions 
during  genotyping  analysis  that  may  occur  whenever  the 
template  strand  3 ’-end  is  capable  of  annealing  to  an  internal 
sequence  motif,  thus  repressing  initiation  of  intra-molecular 
sequencing.7  The  resulting  PCR  primers  are  returned  to  the 
user  as  oligonucleotide  sequences  recommended  for  laboratory 
testing  which  are  either  presented  graphically  or  in  a  tab- 
delimited  format  (Figure  1,  top  panel). 

Computer  software  for  designing  primers  for  use  during 
polymerase  chain  reaction  (PCR)  is  available  from  a  number  of 
sources  (Table  1).  A  principle  use  of  these  applications  is  to  aid 
assay  design  when  evaluating  the  association  between 
phenotypic  changes  and  the  inheritance  of  genetic  markers, 
such  as  single  nucleotide  polymorphisms  (SNPs)  and  nucleotide 
insertions  and  deletions.  Commercial  software  is  also  available 


for  generating  PCR  and  sequencing  primers  (Table  1).  For 
example,  the  manufacturer  of  the  pyrosequence  instrument 
Biotage  LLC  provides  assay  design  software  aimed  at  developing 
primer  sets  suitable  for  use  during  pyrosequencing.  These 
applications  provide  access  to  PCR  and  sequencing  primer 
design  algorithms  but  require  the  manual  input  of  sequence 
along  with  identification  of  the  polymorphic  residues  to  be 
evaluated.  For  genetic  studies  where  many  polymorphisms  are 
being  investigated,  such  as  in  the  investigation  of  a  set  of 
candidate  markers  for  association  with  a  disease,  this 
requirement  creates  a  bottleneck  during  assay  development. 
Thus,  improved  approaches  to  assay  design  that  provide 
integrated  genomic  sequence,  genetic  variant,  and  functional 
information  increase  the  efficiency  by  which  genotyping  assays 
can  be  developed  and  validated  by  laboratory  testing.8 

The  objective  of  the  SOP3  application  has  been  to  provide  a 
method  to  effectively  search  genomic  databases  for  SNPs  and 
insertion/deletion  polymorphisms  in  order  to  design 
genotyping  assays.  The  application  has  been  used  to  design 
primers  that  have  been  validated  for  a  variety  of  SNPs  within 
loci  suspected  of  correlating  with  risk  towards  developing 
disease  as  well  as  FILA  genotyping.912  Localizing  the  NCBI 
dbSNP  database  and  the  data  from  University  of  California 
Santa  Cruz  (UCSC)  GoldenPath  for  the  human  genome 
allowed  the  creation  of  structured  query  language  (SQL)  to 
present  genomic  variation  data  as  well  as  annotation  of 
functional  genomic  elements  in  a  customized  format.  Using 
the  SOP3  application  for  a  multi-loci  genetic  marker 
association  study  is  aided  by  the  use  of  the  integrated  Entrez 
Gene  annotation,  which  identifies  the  start  and  end  markers 
of  a  gene,  as  well  as  the  start  and  end  markers  of  exons,  introns, 
and  the  location  of  polymorphisms  associated  with  splice  site 
regions.  SOP3  designed  primer  trios  continue  to  be  evaluated 
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SCIENTIFIC  COMMUNICATIONS 


Website 

Web  Address 

Description 

Primer  Design  Software: 

Integrated  DNA  Technologies 

http://www.idtdna.com 

Supplier  of  oligonucleotides.  The  website  also  contains 
an  extensive  selection  of  scientific  tools  for  PCR  primer 
design. 

Primer3 

http://frodo.wi.mit.edu/cgi- 
bin/primer3/primer3 www.cgi 

Primer3  is  a  widely  used  program  for  designing  primers 
for  PCR. 

S0P3 

http://imgen.ccbb.pitt.edu/sop3 

Web-based  software  specifically  tailored  for  designing 
primers  for  pyrosequencing. 

Human  Genome  Resources: 

NCBI 

http://www.ncbi.nlm.nih.gov/ 

The  National  Center  for  Biotechnology  Information  (NCBI) 
is  a  national  resource  for  molecular  biology  information 
containing  public  databases  and  software  tools  for 
analyzing  genome  data. 

dbSNP 

http://www.ncbi.nlm.nih.gov/SNP/ 

The  dbSNP  database  serves  as  a  central  repository  for 
both  single  base  nucleotide  subsitutions  and  short 
deletion  and  insertion  polymorphisms. 

Entrez  Gene 

http://www.ncbi .  n  1  m .  n  i  h .  gov/entrez/ 
query.fcgi?db=gene 

Entrez  Gene  is  a  searchable  database  of  genes  maintained 
by  the  National  Center  for  Biotechnology  Information. 

HapMap 

http://www.hapmap.org/ 

The  HapMap  Project  is  an  effort  to  identify  and  catalog 
genetic  similarities  and  differences  in  human  beings. 

Pyrosequencing  Resources: 

Biotage,  AB 

http://www.biotage.com 

Leading  supplier  of  pyrosequencing  reagents  &  instruments. 

Table  1.  Websites  and  Internet  Resources  Related  to  Pyrosequencing. 


in  the  laboratory  as  part  of  ongoing  research  into  the  genetics 
of  complex  diseases  using  human  cohorts  as  well  as  mouse 
model  systems.4,13 

Summary 

The  ability  to  generate  high  resolution  sequence  data  and 
throughput  of  hundreds  of  sequencing  reactions  per  day  make 
pyrosequencing  an  important  method  for  identification  of 
known  genetic  markers.  Based  on  finding  a  short  sequence  in 
each  allele  of  interest  that  occurs  only  once  within  the  PCR 
generated  fragment,  the  pyrosequencing  protocol  allowed 
rapid  identification  of  alleles  for  analysis  of  genetic 
predisposition  to  autoimmune  disease  as  well  as  for 
histocompatibility  typing.  Pyrosequencing  is  an  ideal  tool  for 
rapid  and  accurate  identification  of  genotypes  in  studies  of  the 
genetic  association  of  certain  HLA  alleles  with  the  likelihood 
of  developing  autoimmune  disease,  as  well  as  during  clinical 
genotyping  of  patients  and  donors  for  histocompatibility 
matching  during  transplantation. 
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Web-Based  Primer  Design  Software  for  Genome-Scale 
Genotyping  by  Pyrosequencing® 

Steven  Ringquist,  Christopher  Pecoraro,  Ying  Lu,  Alexis  Styche, 
William  A.  Rudert,  Panayiotis  V.  Benos,  and  Massimo  Trucco 


Summary 

Design  of  locus-specific  primers  for  use  during  genetic  analysis  requires  combining 
information  from  multiple  sources  and  can  be  a  time-consuming  process  when  validat¬ 
ing  large  numbers  of  assays.  Data  warehousing  of  genomic  DNA  sequences  and  genetic 
variations  when  coupled  with  software  applications  for  optimizing  the  generation  of  lo¬ 
cus-specific  primers  can  increase  the  efficiency  of  assay  development.  Selection  of  oli¬ 
gonucleotide  primers  for  PCR  and  Pyrosequencing®  (SOP3)  software  allows  user-directed 
queries  of  warehoused  data  collected  from  the  human  and  mouse  genome  sequencing 
projects.  The  software  automates  collection  of  DNA  sequence  flanking  single-nucle¬ 
otide  polymorphisms  (SNPs)  as  well  as  the  incorporation  of  locus-associated  functional 
information,  such  as  whether  the  SNP  occurs  in  an  exon,  intron,  or  untranslated  region. 
SOP3  software  accepts  three  types  of  user-directed  input  consisting  of  gene  locus  sym¬ 
bols,  SNP  reference  sequence  numbers,  or  chromosomal  physical  location.  For  human 
polymorphisms,  SOP3  incorporates  haplotype,  ethnicity,  and  SNP  validation  attributes. 
The  output  is  a  list  of  oligonucleotide  primers  recommended  for  Pyrosequencing-based 
typing  of  genetic  variations.  SOP3  is  available  at  the  Division  of  Immunogenetics  com¬ 
putational  server  found  at  http://imgen.ccbb.pitt.edu. 

Key  Words:  Bioinformatics;  genetics;  genomics;  genotyping;  sequencing. 

1.  Introduction 

Computer  software  for  designing  primers  for  use  during  PCR  are  available 
from  a  number  of  sources  (1-3).  A  principle  use  of  these  applications  is  to  aid 
assay  design  when  evaluating  the  association  between  phenotypical  changes 
and  the  inheritance  of  genetic  markers,  such  as  single-nucleotide  polymor¬ 
phisms  (SNPs)  including  nucleotide  insertions  and  deletions.  Commercial  soft¬ 
ware  is  also  available  for  generating  PCR  and  sequencing  primers  and  for  use 

From:  Methods  in  Molecular  Biology,  vol.  373:  Pyrosequencing®  Protocols 
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during  DNA  sequencing.  For  example,  the  manufacturer  of  the  Pyrosequencing® 
instrument  Biotage,  LLC  provides  assay  design  software  aimed  at  developing 
primer  sets  suitable  for  use  during  Pyrosequencing.  These  applications  provide 
access  to  PCR  and  sequencing  primer  design  algorithms,  but  require  user  input 
of  FASTA-formatted  sequences  along  with  identification  of  polymorphic  resi¬ 
dues  that  are  to  be  evaluated.  For  genetic  studies  where  many  polymorphisms 
are  being  investigated,  such  as  is  necessary  when  investigating  a  set  of  candi¬ 
date  markers  for  association  with  a  disease  phenotype,  this  requirement  creates 
a  bottleneck  during  assay  development.  Improved  approaches  to  assay  design 
that  provide  integrated  genomic  sequence,  genetic  variant,  and  functional  in¬ 
formation  increase  the  efficiency  by  which  genotyping  assays  can  be  devel¬ 
oped  and  validated  by  laboratory  testing  (4). 

SOP3  is  a  web-based  software  application  that  has  been  developed  for  use  in 
Pyrosequencing  assay  design  and  is  available  at  http://imgen.ccbb.pitt.edu  (5). 
The  computational  Webserver  warehouses  an  integrated  biological  database 
comprised  of  genetic  variants,  as  well  as  DNA  sequence  of  the  human  and 
mouse  genomes,  including  validation  of  human  SNPs  within  the  HapMap  com¬ 
parative  genome  project.  The  software  provides  a  batch  analysis  tool  that  ac¬ 
cepts  the  names  of  as  many  as  100  genetic  loci,  100  SNP  identifiers,  or  a  100-kb 
section  of  genomic  DNA  under  investigation.  The  application  returns  a  set  of 
primers  for  PCR  and  Pyrosequencing  for  all  corresponding  genetic  variants 
listed  in  dbSNP  along  with  flanking  genomic  DNA  sequences  contained  within 
the  human  and  mouse  genome  resources.  Local  warehousing  of  genomic  se¬ 
quences,  making  them  accessible  to  the  application,  enables  the  software  to 
provide  the  choice  of  locus-specific  primer  trios  for  PCR  and  Pyrosequencing 
for  use  in  rapid  genotype  analysis. 

2.  Materials 

2. 1.  System  Configuration 

1.  The  SOP3  application,  genomic  sequence,  and  genetic  variation  data  from  the 
human  and  mouse  genome  projects  are  warehoused  on  a  computational  Webserver 
purchased  from  @Xi  Computer  (San  Clemente,  CA).  The  software  application  is 
written  in  preprocessor  hypertext  protocol  v5.0.3  and  is  associated  with  a  MySQL 
4.1  database  developed  on  a  Linux  SuSE  Enterprise  Server  8  for  the  AMD64 
operating  system  with  Apache  web  server  v2.0.48.  The  computational  server  con¬ 
sists  of  dual  AMD  Opteron  246  64-bit  processors  with  1024  KB  Cache,  8192  MB 
random  access  memory,  and  four  250  GB  hard  drives. 

2.2.  Connecting  to  the  Webserver 

1 .  The  SOP3  website  can  be  accessed  over  the  internet  at  http://imgen.ccbb.pitt.edu. 
The  application  works  best  when  viewed  in  Internet  Explorer  v6  or  higher  but  can 
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also  be  viewed  in  Firefox  as  well  as  Netscape.  Internet  browsers  are  available  for 
download  from  http://www.microsoft.com/windows/ie,  http://browser.netscape.com, 
and  http://www.mozilla.org,  respectively. 

3.  Methods 

The  SOP3  application  has  been  designed  for  use  with  genotyping  protocols 
and  enables  locus-specific  PCR  amplification  of  candidate  SNPs  for  sequence 
analysis.  Filtering  of  genetic  variations  for  validated  and  FlapMap-associated 
SNPs  improves  the  efficiency  of  assay  development  for  studying  the  role  of 

_  DNA  polymorphisms  in  determining  the  phenotype  in  models  of  genetic  inher- 

table  i  itance  (Table  1).  An  advantage  of  the  SOP3  application  is  that  design  of 
genotyping  studies  can  be  developed  using  the  assay  design  algorithm  by  fil¬ 
tering  all  available  genetic  variants  for  those  specified  by  user-directed  query, 
such  as  specifying  human  or  mouse  and  using  only  validated  nucleotide  poly¬ 
morphisms  ( see  Note  1).  The  application  accepts  multiple  queries  of  gene  loci, 
SNPs,  as  well  as  chromosomal  regions  for  analysis.  SOP3  provides  multiple 
choices  for  selecting  the  most  suitable  genetic  markers  for  evaluation  during 
genotyping.  The  User  Guide  for  operation  of  SOP3  software  is  available  online 
at  http://imgen.ccbb.pitt.edu/sop3/userguide. 

3. 1.  Basic  Functions 

1.  Searching  human  or  mouse  genetic  variations.  The  web  interface  to  the  SOP3 
application  allows  the  user  to  specify  whether  to  search  for  SNPs  within  the  hu¬ 
man  or  mouse  genome.  The  default  setting  is  for  human  genetic  variants  and  is 
indicated  on  the  website  by  the  filled  in  button  adjacent  to  the  label  for  “Human.” 
To  specify  searching  of  the  mouse  database  the  user  should  click  the  button  next 
to  “Mouse”  provided  on  the  website  (Fig.  1). 

2.  Searching  by  locus  name.  To  query  the  SOP3  database  for  SNPs  associated  with 
up  to  100  locus  symbols  the  user  can  enter  the  symbol  or  a  list  of  locus  symbols 
into  the  textbox.  Locus  symbols  can  be  typed  directly  into  the  text  box,  entered 
using  the  copy  and  paste  function  on  the  user’ s  computer,  or  by  uploading  from  a 
text  file.  Queries  are  restricted  to  no  more  than  100  entries  per  batch.  Use  the 
check  boxes  provided  to  specify  whether  the  query  is  for  human  or  mouse  ge¬ 
nomes.  Click  the  “Search”  button  to  activate  the  application.  Results  will  appear 
at  the  bottom  of  the  page.  Selecting  the  “Clear”  button  will  refresh  the  applica¬ 
tion  to  the  default  settings. 

3.  Searching  by  SNP  identifier.  To  query  the  SOP3  database  by  SNP  identifier  the 
user  can  enter  either  a  single  reference  sequence  number  or  a  list  of  polymor¬ 
phism  reference  sequence  numbers.  Queries  are  restricted  to  no  more  than  100 
entries  per  batch.  Use  the  check  boxes  provided  to  specify  whether  the  query  is 
for  human  or  mouse  genomes.  Click  the  “Search”  button  to  activate  the  applica¬ 
tion.  Results  will  appear  at  the  bottom  of  the  page.  Selecting  the  “Clear”  button 
will  refresh  the  application  to  the  default  settings. 


FIG  1 


Table  1 

Classification  of  Human  and  Mouse  Genetic  Variants 
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Unknown  479  NA  Samples  with  unknown  geographic  provinces  that  are  not  global 

in  nature. 

W.  Africa  174  NA  Sub-Saharan  nations  bordering  the  Atlantic  north  of  the  Congo 

River  and  central/southern  Atlantic  nations. 
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Primer  Design  Software  3 1 

Table  2 


Maximum  Allowable  Upper  Limit  for  Screening  Chromosome  Range 


Human  chromosome  Maximum  range 

Mouse  chromosome 

Maximum  range 

1 

245,442,500 

1 

195,198,653 

2 

242,818,021 

2 

181,685,801 

3 

199,450,386 

3 

160,571,871 

4 

191,400,945 

4 

154,132,574 

5 

180,837,593 

5 

149,217,787 

6 

170,972,421 

6 

149,553,910 

7 

206,556,958 

7 

133,031,105 

8 

146,272,185 

8 

128,674,990 

9 

138,428,984 

9 

124,140,960 

10 

135,412,916 

10 

130,567,357 

11 

134,451,003 

11 

121,607,694 

12 

132,389,146 

12 

114,933,529 

13 

114,127,336 

13 

116,456,691 

14 

106,360,250 

14 

117,011,917 

15 

100,337,960 

15 

104,102,711 

16 

88,821,548 

16 

98,800,952 

17 

80,652,345 

17 

93,538,550 

18 

76,116,152 

18 

90,878,013 

19 

63,806,020 

19 

60,667,351 

20 

62,435,629 

X 

160,070,598 

21 

46,943,948 

Y 

47,759,179 

22 

49,534,318 

X 

154,823,225 

Y 

57,700,652 

Values  for  maximum  range  were  determined  using  the  highest  chromosomal  position  number 
associated  with  a  SNP  on  the  human  or  mouse  genomes  using  build  35  or  33,  respectively. 


4.  Searching  by  chromosomal  location  and  range.  To  query  the  SOP3  database  by 
chromosomal  range  use  the  check  boxes  to  specify  whether  the  search  will  focus 
on  human  or  mouse  genomes.  Next,  use  the  “Chromosome  Number”  dropdown 
menu  to  specify  which  chromosome  is  to  be  used.  Enter  lower  and  upper  limits 
for  the  nucleotide  positions  that  should  be  screened.  Queries  are  restricted  to 
regions  no  greater  than  100,000  nucleotides.  The  maximum  allowable  upper  limit 

for  screening  a  range  of  nucleotides  on  each  chromosome  corresponds  to  the  - 

length  of  the  chromosome  and  is  indicated  in  Table  2.  Click  the  “Search”  button  TABLE  2 
to  activate  the  application.  Results  will  appear  at  the  bottom  of  the  page.  Select¬ 
ing  the  “Clear”  button  will  refresh  the  application  to  the  default  settings. 

5.  Filtering  the  results  to  show  only  validated  SNPs.  Selection  of  the  “Validated 
SNPs”  search  filter  directs  the  application  to  screen  genetic  polymorphisms  for 
inclusion  in  the  list  of  validated  polymorphisms  provided  by  dbSNP.  There  are 
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currently  384,831  human  and  464,137  mouse  polymorphisms  associated  with  this 
attribute.  Validation  status  is  defined  as  a  genetic  variant  that  has  been  reported 
by  multiple  submissions,  is  linked  to  frequency  data  for  the  SNP  in  a  population, 
or  was  submitter  validated  by  an  updated  submission  from  the  original  report. 
Filtering  the  results  to  show  only  HapMap-genotyped  SNPs.  Selection  of  the 
“HapMap”  search  filter  directs  the  application  to  screen  genetic  polymorphisms 
for  those  validated  polymorphisms  that  are  also  included  in  the  International  HapMap 
Consortium  haplotyping  mapping  project  incorporated  into  the  dbSNP  database. 
There  are  currently  35,126  human  polymorphisms  associated  with  this  attribute. 
Filtering  the  results  by  limiting  the  population  criteria.  Selection  of  the  “Limit 
By  Population  Criteria”  filter  directs  the  application  to  screen  genetic  polymor¬ 
phisms  by  geographic  origin  of  the  sample.  Selection  of  more  than  one  popula¬ 
tion  filter  will  return  those  polymorphisms  that  meet  either  of  the  criteria  (i.e.,  a 
Boolean  OR  operation).  The  number  of  polymorphisms  associated  with  each 
population  group  is  indicated  on  the  application’s  homepage. 

Limiting  the  search  by  function  type.  Genetic  variants  may  be  filtered  by  whether 
they  are  near  a  gene  (locus  region),  in  a  UTR  (untranslated  region),  in  an  intron 
(intron),  or  in  a  splice  site  (splice  site).  If  the  variation  is  in  a  coding  region,  then 
the  functional  class  of  the  variation  depends  on  how  each  allele  may  affect  the 
translated  peptide  sequence,  e.g.,  nonsynonymous  change  or  synonymous  change. 
Selection  of  more  than  one  of  the  function-type  filters  will  return  those  polymor¬ 
phisms  that  meet  either  of  the  criteria  (i.e.,  a  Boolean  OR  operator).  A  definition 
of  these  filters  as  well  as  the  number  of  polymorphisms  associated  with  each 
function  type,  for  human  and  mouse,  is  indicated  in  Table  1. 

Initiating  design  of  primers  for  PCR  and  Pyrosequencing.  Selecting  the  “Search” 
button  will  activate  the  application.  Locus-specific  primers  for  PCR  and 
Pyrosequencing  will  be  returned  for  those  SNPs  that  meet  the  filtering  criteria 
selected  in  the  user  input  section  of  the  web  interface. 

Reporting  the  results.  After  submitting  a  search  the  application  will  present  the 
results  at  the  bottom  frame  of  the  website.  Each  report  consists  of  four  sections, 
attribute  bar,  candidate  primer  bar,  DNA  sequence  flanking  the  polymorphism, 
and  computer-simulated  Pyrosequencing  data  for  heterozygous  and  each  homozy¬ 
gous  genotype  (Fig.  2). 

The  locus  attribute  bar  provides  information  linked  to  the  polymorphism  consist¬ 
ing  of  reference  sequence  number,  identity  of  the  polymorphism,  locus  symbol, 
physical  location  on  the  chromosome,  heterozygosity  value,  and  function  at¬ 
tribute.  If  the  filters  for  “Validated  SNPs,”  “HapMap,”  or  “Limit  By  Population 
Criteria”  were  selected  that  information  will  also  appear  in  the  attribute  bar. 

The  candidate  primer  bar  contains  the  oligonucleotide  sequence  (written  5'  to  3') 
for  the  candidate  PCR  forward  and  reverse  primers  as  well  as  the  Pyrosequencing 
primer  ( see  Note  2).  The  nomenclature  used  for  designating  a  forward  PCR  primer 
is  that  it  is  on  the  same  side  of  the  SNP  as  the  Pyrosequencing  primer.  The  nomen¬ 
clature  for  designating  a  reverse  PCR  primer  is  that  it  is  on  the  opposite  side  of 
the  SNP  as  the  Pyrosequencing  primer.  The  reverse  PCR  primer  should  be  syn¬ 
thesized  with  a  5'  biotin-TEG  modification  during  the  Pyrosequencing  protocol. 
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13.  The  DNA  sequence  flanking  the  SNP  is  indicated  in  the  third  panel  of  the  results 
frame.  The  user  may  view  the  “Full”  sequence  that  was  evaluated,  consisting  of  the 
polymorphism  and  1000  flanking  residues.  The  DNA  sequence  may  also  be  viewed 
in  FASTA  format  by  selecting  the  “Text”  button.  The  option  for  viewing  the  flank¬ 
ing  sequence  as  “Small”  will  display  only  the  sequence  amplified  by  the  locus- 
specific  PCR  primers  ( see  Note  3).  The  color  code  in  the  bottom  frame  shows  PCR 
primers  indicated  in  yellow,  Pyrosequencing  primer  in  red,  the  SNP  is  colored 
blue,  the  amplified  region  in  orange,  and  nearby  polymorphisms  shown  in  gray. 
Lowercase  characters  indicate  the  location  of  repeat  masker-identified  genomic 
regions.  A  “Legend”  button  providing  a  key  to  the  color  coding  is  also  provided. 

14.  Simulated  Pyrosequencing  data  is  displayed  in  the  bottom  panel  of  each  result.  A 
simulated  Pyrogram  is  drawn  for  each  genotype,  i.e.,  heterozygous  and  both  ho¬ 
mozygous  genotypes.  A  nucleotide  dispensation  order  for  evaluating  these  data 
in  the  Pyrosequencing  software  is  illustrated  above  the  Pyrogram  charts.  Infor¬ 
mative  dispensations  for  distinguishing  the  genotypes  are  indicated  in  yellow 
( see  Note  4). 

15.  Minimization  and  maximization  of  the  input/output  frames  can  be  accomplished 
by  clicking  the  “Minimize/Maximize”  button  located  in  the  upper  right-hand  cor¬ 
ner  of  the  web  interface.  A  full-screen  view  of  the  results  of  the  SOP3  application 
is  available,  allowing  easier  browsing  of  the  results  frame. 

1 6.  Sorting  the  results  by  reference  SNP  number,  heterozygosity  value,  physical  position 
on  the  chromosome,  gene  name,  or  function  can  be  accomplished  using  the  “Sort 
By”  dropdown  menus.  The  “Sort  By”  function  is  located  in  the  horizontal  bar 
found  along  the  middle  of  the  web  interface.  Results  may  be  viewed  in  ascending 
“A”  or  descending  “v”  order.  Results  on  the  web  interface  are  presented  20  at  a 
time.  When  greater  than  20  results  are  returned  they  can  be  viewed  by  choosing  the 
“Page”  dropdown  menu.  The  total  number  of  results  and  pages  of  results  is  also 
indicated.  After  selection  of  functions  for  sorting  and  page  display  the  results  can 
be  refreshed  by  clicking  the  “Redraw”  button  located  on  the  middle  toolbar. 

17.  Results  from  the  SOP3  application  can  be  saved  in  an  Excel  formatted  text  file. 
Select  this  function  by  clicking  the  “Save  As.”  This  will  open  a  text  box  for 
designating  a  file  name  and  a  “Save”  button  for  uploading  the  results  to  the  SOP3 
server.  The  saved  file  can  be  viewed  by  selecting  the  “Open  Saved  Results”  but¬ 
ton  located  at  the  top  of  the  page.  Saved  files  are  listed  alphabetically  and  each 
file  contains  a  tab-delimited  list  of  PCR  and  sequencing  primers,  as  well  as  SNP 
attributes  such  as  reference  sequence  number,  heterozygosity  value,  and  function 
attribute.  Selection  of  the  “Save  All”  button  will  allow  all  the  results  to  be  trans¬ 
ferred  to  the  web  server.  Unselecting  of  the  “Save  All”  button  will  allow  only  the 
page  of  results  presently  displayed  in  the  bottom  frame  to  be  saved. 

18.  Selecting  the  “Clear”  button  located  in  the  upper  frame  of  the  web  interface  will 
refresh  the  application  to  the  default  settings. 

3.2.  Advanced  Options  for  PCR  Primer  Design 

1.  Selection  of  the  “Advanced  Options”  button  opens  a  list  of  PCR  and  sequencing 
primer  design  settings  used  in  the  assay  design  algorithm.  This  function  allows 
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the  user  to  specify  customized  settings  for  primer  selection  that  are  different  from 
the  default  settings.  Clicking  the  “Advanced  Options”  a  second  time  will  hide 
these  values.  Selecting  the  “Clear”  button  will  refresh  the  application  to  the  de¬ 
fault  settings. 

2.  The  display  for  PCR  primer  design  settings  allows  the  user  to  adjust  the  primer 
selection  criteria. 

3.  “Melting  Temperature  (Tm)  °C”  default  setting  is  60°C.  Tra  is  calculated  using  the 
relationship  (Tm  =  16.6*log[cation  concentration]  +  41*[fraction  of  GC]  +  81.5, 
where  the  concentration  of  cation  was  estimated  to  be  0.1  M )  described  by 
Schildkraut  and  Lifson  (6).  User-directed  changes  to  this  value  instruct  the  appli¬ 
cation  to  design  primers  to  the  indicated  minimum  Tm  value. 

4.  The  “Minimum  Foldover”  filter  is  set  at  a  default  of  five.  This  value  reflects  the 
minimum  allowable  number  of  contiguous  base-paired  residues  that  can  occur 
within  a  candidate  PCR  primer.  The  function  allows  user-directed  input  in  order 
to  minimize  the  formation  of  potential  primer  secondary  structure  that  can  inter¬ 
fere  with  PCR  efficiency. 

5.  The  “Unique  N’mer  Maximum  Length”  filter  default  setting  is  six  and  specifies 
the  maximum  number  of  3'-end  residues  that  meet  the  requirement  of  only  occur¬ 
ring  once  within  1000  nucleotides  flanking  either  side  of  the  polymorphism. 

6.  The  “A/T  Test”  filter  specifies  that  at  least  one  A  or  T  residue  will  occur  in  the 
last  three  3’  nucleotides  of  the  PCR  primer. 

7.  The  “Residue  Thresholds”  filter  specifies  the  frequency  range  for  each  A,  T,  C,  G 
nucleotide  residue  in  the  candidate  PCR  primer.  The  default  setting  for  this  filter 
is  minimum  14%  and  maximum  40%  nucleotide  composition. 

8.  The  “Flank  Length  (Maximum  1000)”  setting  default  setting  is  1000  and  denotes 
the  length  of  DNA  flanking  the  polymorphism  that  is  used  to  screen  for  suitable 
PCR  primers. 

9.  The  filter  “Use  Repeat  Masker”  indicates  whether  the  results  of  RepeatMasker 
(repeatmasker.org)  will  be  used  when  evaluating  the  DNA  sequence  flanking  the 
SNP.  When  selected,  residues  that  are  identified  as  being  included  in  a  repetitive 
region  will  be  masked  off  and  will  not  be  considered  when  making  the  PCR 
primer,  thus  generating  primers  that  will  not  anneal  to  regions  of  repetitive  DNA. 
Using  this  option  will  generally  decrease  the  number  of  primers  found,  but  will 
better  ensure  that  primers  do  not  anneal  to  multiple  sites. 

10.  The  filter  for  “PCR  Product  Size”  determines  the  range  of  length  of  an  acceptable 
amplified  PCR  product.  The  default  setting  for  this  filter  is  minimum  200  bp  and 
maximum  500  bp. 

3.3.  Advanced  Options  for  Pyrosequencing  Primer  Design 

1.  User-directed  changes  to  the  Pyrosequencing  primer  design  settings  are  accom¬ 
plished  using  the  three  interactive  boxes  available  on  the  lower  panel  of  the  “Ad¬ 
vanced  Options”  window. 

2.  The  “Melting  Temperature  (Tm)  °C”  default  setting  is  40°C.  Tm  is  calculated  us¬ 
ing  the  relationship  (Tm  =  16.6*log[cation  concentration]  +  41*[fraction  of  GC] 
+  81.5,  where  the  concentration  of  cation  was  estimated  to  be  0.1  M)  described 
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by  Schildkraut  and  Lifson  (6).  User-directed  changes  to  this  value  instruct  the 
application  to  design  each  sequencing  primer  to  the  indicated  minimum  Tm  value. 

3.  The  filter  for  “T+A  Percentage”  evaluates  the  design  of  the  candidate  sequencing 
primer  for  %  AT  content  ensuring  that  this  value  falls  within  the  specified  range. 
The  default  range  is  minimum  30%  and  maximum  65%. 

4.  The  interactive  box  for  designating  the  “Distance  From  SNP”  provides  a  tool  for 
specifying  the  minimum  number  of  bases  away  from  the  SNP  to  initiate  the  de¬ 
sign  of  the  Pyrosequencing  primer.  The  default  range  is  minimum  1  and  maxi¬ 
mum  30  nucleotides.  The  software  algorithm  is  designed  to  choose  the  closest 
Pyrosequencing  primer  to  the  SNP  but  with  the  range  of  distance  designated  by 
the  “Distance  From  SNP”  function. 

3.4.  Exploring  the  Website 

1 .  General  information  regarding  warehoused  genomic  and  genetic  variant  data  is 
indicated  in  the  masthead  of  the  web  interface.  For  example,  the  build  of  dbSNP 
and  human  and  mouse  DNA  sequences  used  for  v2  of  SOP3  are  build  123,  35,  and 
33,  respectively.  Updates  to  the  SOP3  database  are  performed  twice  a  year  and 
are  indicated  in  the  masthead. 

2.  The  “Find  Gene  Name”  button  opens  a  link  to  the  search  page  for  determining 
the  locus  symbol  for  either  human  or  mouse  genes  as  specified  by  the  user.  The 
“Fine  Gene  Name”  search  box  allows  simple  Boolean  searches.  It  allows  and,  or, 
not.  It  uses  to  indicate  a  wildcard  character  and  quotation  marks  to  designate 
a  phrase.  Queries  are  submitted  using  the  “Submit  Query”  button. 

3.  The  “Open  Saved  Results”  button  opens  a  link  to  the  list  of  saved  files  available 
on  the  SOP3  web  server.  Selection  of  a  file  provides  a  tab-delimited  document 
consisting  of  the  locus  symbol,  reference  sequence  number,  allele,  heterzygosity 
value,  chromosome  location,  primer  sequences,  and  size  of  expected  PCR  prod¬ 
uct.  The  tab-delimited  document  can  be  saved  to  the  user’s  computer  and  opened 
in  Microsoft  Excel. 

4.  The  “Print”  button  provides  a  link  to  the  user’s  print  command  for  use  in  obtain¬ 
ing  a  hardcopy  of  the  results. 

5.  The  “Upload”  button  opens  a  “Text  Box,”  “Browse”  button,  “Submit”  button, 
and  “Cancel”  button  for  uploading  to  the  web  server  a  text  file  containing  a  batch 
of  locus  symbols  or  reference  sequence  numbers  from  a  file  located  on  the  user’s 
computer.  Selection  of  the  “Browse”  button  prompts  the  user  to  indicate  the  loca¬ 
tion  of  a  carriage  return  delimited  text  file.  Selection  of  the  “Submit”  button  up¬ 
loads  the  query  batch  to  “Search  By  SNP  ID  Or  Locus  Name,”  i.e.,  the 
application’s  main  entry  box.  The  application  is  initiated  by  selecting  the 
“Search”  button.  Selection  of  the  “Cancel”  button  removes  the  upload  feature 
from  the  web  interface  and  allows  the  user  to  query  genomic  SNPs  using  the 
basic  functions  available  on  the  website. 

6.  The  “View  Validated  Primers”  button  opens  a  link  to  the  primer  trios  that  have 
been  made  available  for  general  use.  They  are  listed  in  a  tabulated  format  and 
include  reference  sequence,  locus,  forward  PCR  primer,  reverse  PCR  primer, 
Pyrosequencing  primer,  and  PCR  product  length. 
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3.5.  Online  User  Guide 

1 .  The  “Guide  Me”  button  opens  a  link  to  the  online  user  guide  for  the  SOP3  appli¬ 
cation.  The  user  guide  is  updated  along  with  changes  to  the  application  and  the 
warehoused  database  ( see  Note  5).  Explanations  of  the  choices  available  when 
using  the  application  are  indicated  in  alphabetical  order. 

2.  The  “Info”  buttons  open  information  links  for  selected  functions  available  on  the 
web  interface.  For  example,  selection  of  the  “Info”  button  next  to  the  “Limit  By 
Function  Type”  heading  opens  a  new  browser  window  to  the  online  user  guide 
describing  the  use  of  this  filter  when  selecting  SNPs  for  genetic  analysis. 


4.  Notes 


1 .  Software  applications  for  primer  design  are  available  from  several  sources.  Com¬ 
monly  used  web-based  applications  for  designing  PCR  primers  are  available  at 
the  Primer3  (http://www.primer3.org)  and  IDT  Corporation  (http:// 
www.idtdna.com)  websites  (1).  Both  applications  generate  oligonucleotide  se¬ 
quences  for  locus-specific  PCR  amplification.  They  are  limited,  however,  in  that 
they  do  not  provide  a  sequencing  primer  for  Pyrosequencing.  Another  applica¬ 
tion,  specific  for  Pyrosequencing,  is  Assay  Design  Software  available  commer¬ 
cially  from  Biotage,  LLC.  This  application  provides  access  to  a  primer  design 
algorithm  for  locus-specific  PCR  and  SNP-specific  Pyrosequencing.  The  appli¬ 
cation,  however,  requires  user  input  of  each  SNP  and  DNA  flanking  sequence. 
The  Biotage  Assay  Design  Software  can  be  operated  using  a  batch  mode  once  the 
sequences  are  arranged  in  a  text  file  on  the  user’s  computer.  These  applications 
and  the  SOP3  software  provide  complimentary  solutions  for  selecting  robust  prim¬ 
ers  for  use  during  pyrosequencing-based  typing  of  genetic  variants. 

2.  Addition  of  a  unique  nucleotide  motif  to  the  5'-end  of  the  forward  PCR  primer 
occurs  in  order  to  avoid  formation  of  secondary  structure  in  the  biotinylated  tem¬ 
plate  strand,  which  can  lead  to  competing  sequencing  signal  owing  to  self-prim¬ 
ing  during  Pyrosequencing  (7-9).  The  nucleotide  motif  is  chosen  from  a  list  of 
all  possible  nucleotide  combinations  that  do  not  occur  within  the  DNA  sequence 
flanking  the  SNP. 

3.  Validating  PCR  primers  designed  by  the  computer  application  should  be  per¬ 
formed  by  PCR  amplification  followed  by  analysis  of  the  product  with  agarose 
gel  electrophoresis.  Selection  of  primers  assumes  a  standard  PCR  condition  us¬ 
ing  an  annealing  temperature  of  60°C.  It  is  recommended  that,  when  possible, 
PCR  product  yield  be  examined  at  1  and  2  m M  MgCT  and  over  an  annealing 
temperature  range  from  54  to  65  °C  using  a  gradient  thermal  cycler  instrument. 
PCR  primers  are  considered  validated  if  they  result  in  a  single  amplified  DNA 
product  of  the  expected  size  as  visualized  by  agarose  gel  electrophoresis. 

4.  Validating  Pyrosequencing  primers  is  accomplished  by  performing  the 
Pyrosequencing  reaction  using  a  control  DNA  of  known  quality  and  combined 
with  a  set  of  negative  control  Pyrosequencing  reactions.  The  negative  control 
reactions  are  designed  to  allow  troubleshooting  of  the  sequencing  reaction  by 
analyzing  the  level  of  background  signal  associated  with  the  sequencing  primer, 
biotinylated  reverse  PCR  primer,  and  possible  alternative  PCR  products.  Recom- 
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mended  negative  control  reactions  are  as  follows:  (1)  PCR  product  without  the 
addition  of  Pyrosequencing  primer;  (2)  PCR  negative  control  reaction  with 
Pyrosequencing  primer;  (3)  PCR  negative  control  reaction  without 
Pyrosequencing  primer;  (4)  Biotinylated  reverse  PCR  primer  alone;  (5) 
Pyrosequencing  primer  alone;  (6)  Biotinylated  PCR  primer  with  Pyrosequencing 
primer.  Failure  to  positively  genotype  samples  using  SOP3  designed  primers  for 
PCR  and  Pyrosequencing  can  often  be  the  result  of  background  signal  associated 
with  the  negative  control  reactions.  Isolation  of  the  background  signal  can  be 
helpful  in  redesigning  assays  for  selection  of  new  primer  trio  sets. 

5.  Contact  information  for  web  server  administration.  The  SOP3  application  is  main¬ 
tained  by  the  Division  of  Immunogenetics  at  the  Children’s  Hospital  of  Pitts¬ 
burgh.  Inquiries  concerning  customized  searches  of  the  database  or  reports  of 
errors  in  the  application  should  be  addressed  to  Steven  Ringquist  (email: 
smr73@pitt.edu). 
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Complications  of  type  1  diabetes  (T1D),  even  with 
reduction  in  risk  as  a  result  of  better  glycemic  control, 
are  still  the  major  concern  for  pediatricians  dealing 
with  young  patients  who  during  their  lifetime  will 
eventually  have  to  face  impaired  vision,  reduced 
peripheral  sensitivity,  and  kidney  disease.  T1D  is  asso¬ 
ciated  with  increased  risk  of  nephropathy,  a  clinical 
syndrome  usually  accompanied  by  other  diabetic- 
related  complications  such  as  retinopathy  and  neu¬ 
ropathy.  Blood  pressure  elevation,  cerebrovascular 
disease  and  high  risk  of  cardiovascular  morbidity  and 
mortality  accompany  the  renal  symptoms,  completing 
the  picture  of  the  syndrome  (1).  An  estimated  20^10% 
of  T1D  patients  will  develop  diabetic  nephropathy 
(DN),  clinically  first  evidenced  by  microalbuminuria, 
during  their  lifetime.  If  untreated,  nearly  all  T1D 


patients  experiencing  microalbuminuria  will  progress 
to  overt  nephropathy,  evidenced  by  macroalbuminu¬ 
ria,  followed  by  declining  kidney  function,  and 
culminating  in  end-stage  renal  disease  (ESRD)  (2,  3). 
In  addition  to  frequent  morbidity  linked  to  treating 
complications  of  the  kidney  (e.g.,  dialysis  and  kidney 
transplantation),  patients  experience  a  high  rate  of 
mortality,  with  a  5-yr  survival  rate  of  21%  once  ESRD 
has  developed.  An  estimated  45%  of  new  patients 
requiring  dialysis  and  kidney  transplant  are  diabetic 
(T1D  and  T2D),  and  diabetes  is  the  most  common 
and  rapidly  increasing  cause  of  ESRD  in  the  US  and 
European  populations. 

The  prevalence  and  course  of  DN  are  similar  in 
T1D  and  T2D  patients  when  matched  for  duration  of 
the  disease  (4).  T2D  incidence  is  increasing  in  children 


307 


Pasquali  et  al. 


and  has  become  a  major  concern  for  pediatricians.  DN 
follows  an  established  natural  history.  The  cardinal 
clinical  feature  of  the  syndrome  is  the  progressive 
increase  in  urine  protein  excretion  rate  (2).  The  clinical 
course  starting  with  microalbuminuria  through  pro¬ 
teinuria  and  azotemia  culminates  with  ESRD.  Before 
the  onset  of  overt  nephropathy,  a  prolonged  period  of 
clinical  silence  hides  various  changes  in  renal  function 
such  as  hyperfiltration,  hyperinfusion,  and  increasing 
capillary  permeability  to  macromolecules. 

A  progressive  rise  in  arterial  blood  pressure  and 
albuminuria  accompanies  glomerular  filtration  rate 
(GFR)  decline.  T1D  patients  predominantly  develop 
diastolic  hypertension,  whereas  T2D  patients  typically 
manifest  systolic  hypertension  (1).  Microalbuminuria 
remains  the  best  predictor  of  DN  in  both  T1D  and 
T2D  patients  (5).  Risk  factors  correlated  with  disease 
progression  are  poor  glycemic  control,  hypertension, 
dyslipidemia,  elevated  serum  cholesterol,  and  smoking 
(6-8).  ESRD  is  the  major  cause  of  mortality  in  T1D 
patients  and  is  the  dominant  indicator  of  fatality  as 
a  result  of  cardiovascular  disease  (9,  10). 

Changes  in  kidney  structure  described  in  diabetic 
patients  (T1D  and  T2D)  include  nodular  glomerular 
sclerosis  (Kimmelstiel-Wilson  nodular  disease)  (11) 
and  a  diffuse,  generalized  process  of  mesangial  expan¬ 
sion  (diffuse  diabetic  glomerulosclerosis).  In  addi¬ 
tion,  ultrastructural  changes  such  as  thickening  of  the 
tubular  and  the  glomerular  basement  membrane 
(GBM),  arteriolar  hyalinosis  (afferent  and  efferent) 
and  increase  of  mesangial  matrix  component  have 
been  observed  in  both  T1D  and  T2D.  Changes  in  the 
GBM  and  the  mesangium  already  can  be  detected 
after  2  yr  of  diabetes  during  a  period  of  clinical  silence 
(12).  However,  in  some  T2D  patients,  a  more  hetero¬ 
geneous  renal  pathology  has  been  described  (13,  14). 
The  latter  includes  superimposed  changes  observed  in 
T2DM  such  as  chronic  vascular  (arteriosclerotic  type) 
and  tubulointerstitial  lesions  or  glomerular  changes 
unrelated  to  diabetes  such  as  proliferative  glomerulo¬ 
nephritis  or  membranous  nephropathy  (15).  The 
atypical  pattern  in  renal  pathology  described  for 
T2D  may  underline  the  interplay  of  different  factors 
acting  as  promoters  of  progression  in  the  disease  other 
than  hyperglycemia. 

Genetic  risk  of  DN  and  ESRD 

It  is  widely  accepted  that  poorly  controlled  blood 
glucose  while  closely  correlated  with  T1DN  is 
insufficient  to  fully  account  for  the  incidence  of 
kidney  disease  in  this  population.  Approximately  half 
of  the  patients  with  poor  glycemic  control  do  not 
develop  nephropathy  (16-18).  Epidemiologic  study  of 
the  disease  has  provided  evidence  supporting  the 
hypothesis  that  there  is  a  genetic  predisposition. 
Among  T1D  patients  who  have  healthy  kidney 


function  30  yr  after  onset  of  diabetes,  the  incidence 
of  T1DN  decreases,  even  among  individuals  experi¬ 
encing  retinopathy.  New  cases  of  T1DN  are  few 
among  individuals  with  long-lasting  T1D  (3,  19,  20). 
Studies  of  siblings  concordant  for  T1D  have  demon¬ 
strated  familial  clustering  of  DN.  The  increased 
familial  incidence  of  T1DN  allowed  the  estimation 
of  the  genetic  risk  ratio  for  a  sibling  ranging  between 
roughly  twofold  for  T1DN  and  threefold  for  T1D- 
ESRD  (21-24). 

T1DN  is  a  complex  genetic  disease  indicating  that 
synergy  among  several  genes,  with  varying  individual 
effects,  are  contributing  to  the  disease.  Models  of 
genetic  susceptibility  to  T1DN  are  based  primarily  on 
the  analysis  of  Rogus  et  al.  (25)  who  used  the  data 
reported  by  Quinn  et  al.  (23)  to  simulate  the  incidence 
of  T1DN  among  siblings.  Their  analysis  modeled  the 
occurrence  of  T1DN  assuming  co-occurrence  of 
71.5%  when  the  index  case  exhibited  T1DN  and 
25.4%  when  the  index  case  presented  T1D  but  with 
normal  kidney  function.  Comparison  of  these  obser¬ 
vations,  when  overlaid  onto  a  genetic  model  of  an 
autosomal  dominant  gene  with  20%  disease  allele 
frequency  and  100%  penetrance,  implied  a  36% 
lifetime  risk  of  developing  nephropathy  in  the 
individuals  with  T1D.  Risks  of  66.2  and  19.0%  were 
predicted  for  T1D  siblings  of  the  index  cases  with  and 
without  T1DN,  respectively.  In  contrast,  an  autoso¬ 
mal  recessive  model  assuming  60%  disease  allele 
frequency  in  the  Caucasian  population  also  accounted 
for  the  observed  lifetime  risk  of  T1DN  and  similar 
sibling  risks  of  64.0  and  23.3%  for  an  index  case  with 
siblings  who  are  T1DN  or  T1D,  respectively. 

Genome-wide  mapping  aimed  at  understanding  the 
basis  for  inherited  risk  of  diseases  coincident  with 
T1DN  (i.e.,  T1D  and  hypertension)  have  identified 
genetic  elements  consistent  with  their  etiology 
(Table  1).  For  example,  T1D  has  been  strongly  linked 
to  inheritance  of  histocompatibility  leukocyte  antigen 
(HLA)  class  II  DRB1  and  DQB1  loci  expressing 
a  non-aspartate  residue  at  amino  acid  57  (26,  27).  An 
estimated  95%  of  T1D  patients  have  inherited  at  least 
one  copy  of  the  DRB1*0301,DQB1*0201  or  the 
DRB1*04,DQB  1*0302  haplotype  (28).  Between  30 
and  40%  of  T1D  patients  are  heterozygous  for  the 
combined  haplotype  despite  the  expected  frequency  of 
2%  in  their  respective  populations  (29-31).  However, 
HLA  is  estimated  to  account  for  only  40%  of  the 
genetic  risk  of  developing  T1D  (32-34).  Genetic 
analyses  of  affected  individuals  have  identified  addi¬ 
tional  T1D  susceptibility  loci,  reproducibly  implicat¬ 
ing  chromosomal  regions  1  lp  1 5.5  (insulin-dependent 
diabetes  mellitus  [IDDM]2,  INS),  2q33  (IDDM12, 
CTLA4  region),  and  lpl3  (PTPN22)  (34-36). 

Comparison  of  biopsy  tissues  from  the  kidneys  of 
T1DN  patients  supports  the  hypothesis  that  familial 
factors  affect  severity  of  the  glomerular  lesion.  In  fact 
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Table  1.  Confirmed  genetic  elements  influencing  susceptibility  to  type  1  diabetes 


Locus 

symbol 

IDDM 

region 

Cytogenetic 

band 

Odds  ratio 

Sibling  risk 
ratio 

Description 

HLA-DQB1 

IDDM1 

6p21 .3 

95 

2. 5-3. 6 

HLA  class  II  locus  DQB1  accounts  for 
greater  than  40%  of  the  relative  genetic 
risk  associated  with  T1D 

INS 

IDDM2 

1 1  p15.5 

2.7 

1.29 

Association  between  a  VNTR  marker 
located  immediately  5'  of  the  insulin 
gene  locus 

CTLA4 

IDDM12 

2q33 

1.2 

1.01 

The  CTLA4  locus  along  with  ICOS 
and  CD28  are  each  strong  candidates 
for  IDDM12-associated  risk 

PTPN22 

1p13 

1.8 

1.23 

The  PTPN22  locus  encodes  a  tyrosine 
phosphatase  and  is  linked  with  an 
increased  risk  for  T1D  as  well  as  other 
autoimmune  diseases,  e.g.,  rheumatoid 
arthritis,  systemic  lupus  erythematosus, 
and  thyroiditis 

HLA,  histocompatibility  leukocyte  antigen;  ICOS,  inducible  t-cell  costimulator;  IDDM,  insulin  dependent  diabetes  mellitus; 
INS,  insulin;  T1D,  type  1  diabetes;  VNTR,  variable  number  of  tandem  repeat.  Data  are  summarized  from  genetic  linkage 
analysis  described  by  Onengut-Gumuscu  and  Concannon  (36)  and  Motzo  et  al.  (147). 


significant  similarities  between  glomerular  lesions 
were  identified  even  when  sibling  T1D  patients  were 
discordant  for  glycemic  control  (37).  The  correlation 
in  severity  of  glomerular  lesions  among  T1D  siblings 
independent  of  hemoglobin  A1C  (HbAlC)  levels  is 
consistent  with  an  underlying  genetic  predisposition. 
Development  of  lesions  in  normal  kidneys  trans¬ 
planted  into  diabetic  patients  has  been  described  in  a 
long-term  study  of  recipients  6-14  yr  after  trans¬ 
plantation.  Wide  variation  in  the  rate  of  development 
of  lesions  in  different  kidneys  was  observed  indepen¬ 
dently  from  the  history  of  blood  glucose  levels.  This 
suggests  that  in  addition  to  glycemic  control  per  se  risk 
factors  intrinsic  to  the  kidney  itself  are  present  (38,  39). 

In  addition  to  T1D,  hypertension  is  an  inheritable 
risk  factor  linked  to  development  of  kidney  disease. 


indicating  that  the  genetics  of  T1DN  and  essential 
hypertension  may  be  linked  (Tables  2  and  3).  By 
controlling  blood  volume,  the  kidney  is  responsible 
for  long-term  blood  pressure  control.  Impaired  kidney 
function  is  a  principal  component  underlying  the 
physiology  of  essential  hypertension  (40,  41).  Family- 
based  studies  have  reported  an  increased  incidence  of 
hypertension,  3CM10%  higher  in  patients  who  develop 
T1DN  (42,  43),  whereas  patients  with  uncomplicated 
T1D  exhibit  relatively  lower  blood  pressures  (41). 
Parents  of  T1DN  patients  frequently  exhibit  increased 
blood  pressure  when  compared  with  parents  of 
unaffected  T1D  patients,  suggesting  a  familial  basis 
linking  hypertension  with  impaired  kidney  function 
(43).  Studies  designed  to  deconstruct  the  role  of 
genetics  on  hypertension  have  indicated  that  different 


Table  2.  Genes  linked  to  Mendelian  hypertensive  disorders 


Locus  symbol(s) 

Mendelian  disorder 

Physiological  role 

Description 

HSB11B1 

Apparent  mineralocorticoid 
excess 

Salt-water  homeostasis 

1 1-beta-hydroxysteroid 
dehydrogenase,  type  1 
(autosomal  recessive  trait) 

CYP11B1, 

CYP11B2 

Glucocorticoid-remediable 

aldosteronism 

Salt-water  homeostasis 

Cytochrome  P450,  subfamily  XIB, 
polypeptide  1  and  cytochrome  P450, 
subfamily  XIB,  polypeptide  2 
(autosomal  dominant  trait) 

SCNN1B, 

SCNN1G 

Liddle  syndrome 

Salt-water  homeostasis 

Sodium  channel,  nonvoltage-gated  1, 
beta  subunit  and  sodium  channel, 
nonvoltage-gated  1,  gamma  subunit 
(autosomal  dominant  trait) 

NR3C2 

Mutations  in  mineralocorticoid 
receptor 

Salt-water  homeostasis 

Nuclear  receptor  subfamily  3, 
group  C,  member  2 
(autosomal  dominant  trait) 

WNK1 ,  WNK4 

Pseudohypoaldosteronism 
type  II 

Salt-water  homeostasis 

Protein  kinase,  lysine-deficient  1  and 
protein  kinase,  lysine-deficient 

4  (autosomal  dominant  trait) 

Data  are  summarized  from  Mullins  et  al.  (45),  Litton  et  al.  (119),  and  Cowley  (148). 
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Table  3.  Candidate  loci  influencing  susceptibility  to  essential  hypertension 


Locus  symbol 

Cytogenetic  band 

Physiological  role 

Description 

Candidate  genes  located  on  chromosome  1q 

AGT 

1q42-q43 

RAS 

Angiotensin  1 

AVPR1B 

1q32 

Salt-water  homeostasis 

Arginine  vasopressin  receptor  IB 

HSD11B1 

1q32-q41 

Steroid  metabolism 

1 1-beta-hydroxysteroid  dehydrogenase, 
type  1 

NPR1 

1q21-q22 

Salt-water  homeostasis 

Natriuretic  peptide  A  type  receptor 

REN  1q32 

Candidate  genes  located  on  chromosome  3q 

RAS 

Renin 

AGTR1 

3q21-q25 

RAS 

Angiotensin  II  receptor,  type  1 

CLCN2 

3q27-q28 

Salt-water  homeostasis 

Chloride  channel  2 

DRD3 

3q13.3 

adrenergic  system 

Dopamine  receptor  D3 

ECE2 

3q28-q29 

Endothelin  related 

Endothelin-converting  enzyme  2 

KNG1 

3q27 

Kallikrein— Kinin  system 

Kininogen  1 

MME 

3q25.1-q25.2 

Salt-water  homeostasis 

Membrane  metallo-endopeptidase 
(neutral  endopeptidase,  enkephalinase) 

RAS,  renin-angiotensin  system.  Data  are  summarized  from  DeWan  et  al.  (44),  Hamet  et  al.  (149),  and  Jeunemaitre  et  al.  (150). 


genes  may  contribute  to  increased  incidence  of 
hypertension  in  different  ethnic  groups  (44,  45). 
Genome-wide  scans  have  implicated  different  regions 
of  chromosome  3  in  Caucasian  and  African-American 
populations  as  well  as  genetic  markers  on  chromo¬ 
some  1  and  6  among  Caucasians  (43,  45,  46)  (Table  3). 
Improved  understanding  of  the  inherited  basis  of 
T1DN  may  lead  to  deeper  understanding  of  the 
molecular  mechanisms  underlying  the  disease,  leading 
to  improved  therapeutic  intervention  as  well  as 
improved  understanding  of  why  the  majority  of  T1D 
patients  maintain  normal  kidney  function  while 
a  substantial  minority  go  on  to  exhibit  T1DN. 

A  number  of  genome -wide  linkage  scans  have  been 
performed  in  order  to  evaluate  the  genetics  underlying 
impaired  kidney  phenotype.  The  first  genome  scans 
were  performed  in  a  Pima  Indian  cohort  with  T2D 
and  proteinuria.  Genetic  analysis  of  sib  pairs  identi¬ 
fied  significant  linkage  on  chromosome  7.  Additional 
loci  were  identified  on  chromosomes  3q  and  9. 
Genome  scans  performed  in  an  African-American 
cohort  affected  by  T2D  and  ESRD  revealed  evidence 
for  susceptibility  on  chromosomes  3q,  7p,  and  18q 
(Table  4).  However,  as  these  studies  were  performed 
using  DNA  obtained  from  affected  sib  pairs  concor¬ 
dant  for  both  T2D  and  proteinuria  or  ESRD,  positive 
signals  may  have  cosegregated  with  T2D  or  nephrop¬ 
athy.  Following  nephropathy  only,  a  study  performed 
using  a  cohort  of  18  large  Turkish  families  with  T2D 
and  proteinuria  showed  strong  linkage  with  chromo¬ 
some  18q.  A  recent  genome-wide  association  study 
has  been  performed  genotyping  greater  than  80  000 
single  nucleotide  polymorphisms  (SNPs)  using  a 
cohort  of  T2DN  (case)  and  T2D  (control)  patients, 
identifying  genetic  association  with  the  engulfment 
and  cell  motility  1  gene  (ELM 01)  and  a  p  value  of  less 
than  8  X  10-6  on  chromosome  7p  (47).  Osterholm 
et  al.  (48)  have  recently  performed  a  genome-wide 


linkage  analysis  using  T1D  sib  pairs  discordant  for 
nephropathy.  The  study  identified  five  genomic 
regions  with  possible  linkage  to  T1DN,  implicating 
chromosomes  3q,  4p,  9p,  16q,  and  22p.  The  overlap 
between  chromosome  3q  identified  from  this  study 
and  those  performed  with  T2DN  cohorts  suggests 
a  possible  area  of  focus  for  future  genetic  analyses. 

The  complex  etiology  of  DN  reflects  the  intricate 
nature  of  the  disease,  requiring  a  combination  of  alleles 
of  several  genes  in  addition  to  exposure  to  environ¬ 
mental  factors.  The  role  of  shared  environmental  risk 
factors  has  been  extensively  evaluated,  and  while  their 
exact  nature  is  unclear,  it  is  likely  that  similar  lifestyles 
such  as  eating  and  smoking  habits  are  more  prevalent 
among  siblings  compared  with  unrelated  T1D  patients. 
Glycemic  control  has  been  proposed  to  be  a  risk  factor 
for  nephropathy  (49).  Studies  on  diabetic  twins  have 
indicated  that  HbAlC  levels  are  at  least  partially 
genetically  determined,  accounting  for  greater  than 
60%  of  the  population  variance  of  glycosylated  hemo¬ 
globin  (50).  Individuals  with  good  glycemic  control, 
whether  through  genetics  or  as  a  result  of  rigorous 
testing  and  adjustment  of  insulin  doses  to  control  blood 
glucose  levels,  are  able  to  reduce  their  risk  of  de¬ 
veloping  diabetic  complications,  including  T1DN  as 
was  shown  in  the  Diabetes  Control  and  Complica¬ 
tions  Trial  (DCCT),  as  explained  later  in  this  text. 

Risk  factors  associated  with  the  progression 
of  T1  DN 

What  factors  contribute  to  onset  of  nephropathy  as 
a  complication  in  T1D  patients?  Early  age  of  T1D 
onset  may  be  linked  to  the  increased  risk  of  T1DN 
(51),  but  this  conclusion  has  been  controversial  (52, 
53).  The  observation  that  the  risk  for  T1DN  reaches 
a  peak  incidence  at  ages  25-29  yr  and  occurs  equally  in 
both  males  and  females  may  indicate  that  the  triggering 
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Table  4.  Candidate  loci  influencing  susceptibility  to  diabetic  nephropathy 


Locus  symbol 

Cytogenetic  band 

Physiological  role 

Description 

Candidate  genes 

located  on  chromosome 

3q 

NCK1 

3q21 

Growth  factor/cytokine/signal 
transduction 

NCK  adaptor  protein  1 

RAB7 

3q21 .3 

GTP-binding  protein  related 

RAB7,  member  RAS 
oncogene  family 

SLC2A2 

3q26.1-q26.3 

Carbohydrate  transport/metabolism 

GLUT2  (solute  carrier  family  2 
(facilitated  glucose  transporter), 
member  2 

SUCNR1 

3q24-q25.1 

Metabolism 

Succinate  receptor  1 

Candidate 

genes 

located  on  chromosome 

4p 

ADD1 

4p16.3 

Cytoskeletal  component 

Adducin  1 

Candidate 

genes 

located  on  chromosome 

7p 

ELMOI 

7p14.2 

Apoptosis  related 

Engulfment  and  cell  motility  1 

IGFBP1 

7p14-p12 

Growth  factor/cytokine/signal 
transduction 

Insulin-like  growth  factor-binding 
protein  1 

IL6 

7p21 

Growth  factor/cytokine/signal 
transduction 

Interleukin  6 

NXPH1 

7p22 

Cell  adhesion 

Neurexophilin  1 

RAC1 

7p22 

GTP-binding  protein  related 

Ras-related  C3  botulinum  toxin 
substrate  1  (rho  family,  small 
GTP-binding  protein  Rac  1) 

Candidate 

genes 

located  on  chromosome 

16q 

BCAR1 

16q23.1 

Cell  adhesion 

Breast  cancer  anti-estrogen 
resistance  1 

CDH1 

16q22.1 

Tight  junction  component 

Cadherin  1,  type  1, 

E-cadherin  (epithelial) 

CDH3 

16q22.1 

Tight  junction  component 

Cadherin  3,  type  1,  P-cadherin 
(placental) 

CETP 

16q21 

Fatty  acid/cholesterol 
transport/metabolism 

Cholesteryl  ester  transfer  protein, 
plasma 

CYBA 

16q24 

Reactive  oxygen 
species  metabolism 

Cytochrome  b(-245),  alpha  subunit 

MAPK3 

1 6q  1 1 .2 

Growth  factor/cytokine/signal 
transduction 

Mitogen-activated  protein  kinase  3 

MMP2 

1 6q  1 3 

Extracellular  matrix  component 

Matrix  metalloproteinase  2 

Candidate 

genes 

located  on  chromosome 

18q 

BCL2 

18q21 .3 

Apoptosis  related 

B-cell  CLL/lymphoma  2 

DSC2 

1 8q  12.1 

Cell  adhesion 

Desmocollin  2 

SERPINB7 

18q21 .33 

Extracellular  matrix  component 

Serpin  peptidase  inhibitor, 
cladeb(ovalbumin),  member  7 

SMAD7 

18q21,1 

Growth  factor/cytokine/signal 
transduction 

SMAD  family  member  7 

CLL,  chronic  lymphocytic  leukemia;  GLUT,  glucose  transporter;  RAB,  RAS-associated  protein;  RAS,  renin-angiotensin 
system;  SMAD,  homolog  of  mothers  against  decapentaplegic.  Data  are  summarized  from  Shimazaki  et  al.  (47), 
Osterholm  et  al.  (48),  He  et  al.  (143),  and  Ewens  et  al.  (90). 


event(s)  may  be  independent  of  differences  in  T1D 
onset.  The  triggering  event  may,  however,  be  linked  to 
puberty  and  the  beginning  of  adolescence.  Puberty- 
associated  deterioration  in  glycemic  control,  hormonal 
changes,  as  well  as  psychosocial  pressures  have  come 
under  increased  scrutiny  for  their  influence  in  patients 
who  fail  to  maintain  good  blood  glucose  control  (54). 
Glycemic  control  often  deteriorates  during  adolescence 
in  individuals  with  T1D  and  may  result  from  develop¬ 
mental  changes,  especially  increased  growth  hormone 
secretion  during  puberty,  as  well  as  the  transition  to 
more  autonomous  lifestyle  typified  by  adulthood.  In 
some  female  patients,  there  is  also  the  emergence  of 
eating  disorders  and  the  misuse  of  insulin  as  a  means  to 
control  weight  gain  (54,  55). 


Constitutional  factors 

Predisposition  to  the  development  of  DN  can  be 
because  of  genetic  susceptibility  as  well  as  factors 
operating  in  utero.  Intra-uterine  malnutrition  has  been 
associated  with  a  reduced  number  of  nephrons  (56), 
increasing  the  risk  of  kidney  diseases  and  hypertension 
in  adult  life  (57).  Exposure  to  unfavorable  intra-uterine 
environment  leading  to  intra-uterine  growth  retarda¬ 
tion  (IUGR)  was  proposed  as  a  risk  factor  for  DN 
progression.  This  conclusion  follows  from  the  hypoth¬ 
esis  that  reduced  nephron  number  observed  in  IUGR 
patients  (58)  may  lead  to  reduced  functional  reserve 
in  those  kidneys  subjected  to  potentially  toxic  agents 
(e.g.,  hyperglycemia  or  smoking).  Rossing  et  al.  (59) 
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observed  an  increased  risk  of  DN  in  women  with 
IUGR  and  T1D.  This  was  confirmed  for  both  sexes  in 
a  cohort  of  Pima  Indian  type  2  diabetic  patients  (60).  In 
contrast,  the  conclusion  failed  to  be  reproduced 
according  to  observations  reported  by  Eshoj  et  al. 
(61 )  and  in  a  recent  study  on  Finnish  T1D  patients  (62). 
Contradictory  results  from  different  studies  have  not 
yet  clarified  if  there  is  a  correlation  between  IUGR  and 
DN;  however,  individuals  with  essential  hypertension 
were  shown  to  have  a  markedly  reduced  number  of 
nephrons  in  addition  to  an  inverse  correlation  between 
birth  weight  and  adult  systolic  blood  pressure  (63).  It  is 
possible  that  intra-uterine  factors  may  have  a  direct 
effect  on  DN  progression  later  in  life  although  the 
effect  may  be  indirect,  e.g.,  via  hypertension. 

Hyperglycemia 

Rates  of  decline  in  GFR  have  been  linked  in  T1D 
patients  to  increased  levels  of  blood  glucose  and  occur 
at  a  rate  of  roughly  10-14  ml/min/yr  in  patients  with 
persistent  albuminuria  (64).  This  observation  has  been 
confirmed  in  several  independent  studies,  indicating 
that  HbAlC  levels  are  predictive  of  the  risk  of 
subsequent  onset  of  ESRD.  In  the  absence  of 
therapeutic  intervention,  the  majority  of  patients  with 
sustained  microalbuminuria  progress  to  overt 
nephropathy  over  a  period  of  10-15  yr  (65-69). 

Control  of  blood  glucose  levels  is  critical  to  the 
development  of  the  characteristic  complications  of 
diabetes,  including  DN,  as  has  been  demonstrated  by 
the  outcome  of  the  DCCT.  The  DCCT  was  carried 
out  from  1983  to  1993  and  assessed  the  relative  effects 
of  intensive  and  conventional  insulin  therapy  treat¬ 
ments  on  preventing  development  and  progression  of 
complications  (16).  Enrolling  greater  than  1400  adult 
patients  with  T1D,  the  DCCT  showed  that  HbAlC 
levels  could  be  reduced  from  9.1%  to  7.2%  with 
a  corresponding  reduction  in  progression  of  renal 
complications  (16).  At  the  end  of  the  study,  the 
incidence  of  microalbuminuria  was  reduced  by  34%  in 
a  cohort  in  which  no  evidence  of  retinopathy  or 
nephropathy  was  present  at  baseline.  In  a  second 
cohort,  mild  and  early  complications  were  present  and 
the  DCCT  reported  a  43%  reduction  in  the  incidence 
of  microalbuminuria.  The  success  of  intensive  insulin 
therapy  for  the  treatment  of  diabetes  was  confirmed 
by  a  follow-up  study  showing  persistent  long-term 
reduction  in  the  incidence  of  T1DN  (70). 

Compliance  with  the  intensive  insulin  therapy 
regime  2  yr  after  conclusion  of  the  study  indicated 
that  95%  of  the  intensive  therapy  group  reported 
following  intensive  therapy  regime  and  70%  of  the 
conventional  treatment  group  now  reported  following 
intensive  therapy  regime.  This  means  that  30%  of  the 
conventional  treatment  patients  failed  to  adopt 
intensive  therapy  despite  knowledge  of  the  results. 


Comparison  of  HbAlC  levels  after  2  yr  post-DCCT 
indicated  that  the  intensive  therapy  group  experienced 
HbAlC  levels  of  7.2%  during  DCCT  and  7.9%  2  yr 
after  conclusion  of  the  trial.  Of  these  patients,  46% 
continued  to  monitor  blood  glucose  four  or  more 
times  a  day.  Patients  from  the  conventional  treatment 
group  who  converted  to  intensive  therapy  demon¬ 
strated  HbAlC  levels  of  8.1%  post-DCCT.  Of  these 
patients,  36%  reported  monitoring  blood  glucose  four 
or  more  times  a  day.  The  benefits  of  intensive  therapy 
5  yr  post-DCCT  were  disappointing  in  that  both 
cohorts  reported  average  HbAlC  levels  exceeding  8%. 
One  conclusion  of  the  study  is  that  in  the  absence  of 
a  strong  research  motivation  and  monthly  visits  with 
a  health  care  provider,  the  degree  of  intensive 
treatment  shown  in  the  DCCT  was  not  maintained. 
Moreover,  the  availability  of  health  insurance  corre¬ 
sponded  to  success  as  measured  by  reduction  of 
HbAlC  levels.  Patients  receiving  intensive  therapy 
5  yr  post-DCCT  without  health  insurance  maintained 
an  average  HbAlC  of  8.6%,  while  patients  with  health 
insurance  maintained  an  average  HbAlC  of  8.1%. 

Unfortunately,  patients  participating  in  intensive 
insulin  therapy  treatments  for  diabetes  are  more  likely 
to  experience  severe  adverse  effects  resulting  from 
frequent  injections  of  insulin.  Severe  hypoglycemic 
events  occurred  nearly  threefold  more  often  in  patients 
enrolled  in  the  intensive  therapy  regime  and  were 
characterized  as  requiring  assistance  of  another  person. 
This  precluded  the  enrollment  of  children  into  the 
study.  Severe  hypoglycemic  episodes  occurred  in  25% 
of  the  total  number  of  reported  cases  of  hypoglycemia 
and  were  manifested  by  coma  or  convulsions.  Of  the 
adult  patients  receiving  intensive  insulin  therapy,  22% 
experienced  at  least  five  episodes  of  severe  hypoglyce¬ 
mia  vs.  4%  of  conventionally  treated  subjects.  Life- 
threatening  complications  of  hypoglycemia  are  not 
uncommon,  limiting  the  availability  of  intensive  insulin 
therapy  as  a  means  to  prevent  onset  of  DN. 

Hypertension 

High  blood  pressure  is  prevalent  in  families  in  which 
T1DN  is  present  compared  with  T1D  families  without 
DN  (43).  Hypertension-associated  microalbuminuria 
is  a  predictive  indicator  of  overt  nephropathy  in 
T1D  patients  (71).  While  normalization  of  blood 
glucose  levels  has  been  linked  to  healing  of  microvas- 
cular  lesions  in  the  kidney,  treatment  of  hypertension 
has  resulted  in  improved  renal  outcome  by  preventing 
glomerular  damage.  Clinical  studies  have  indicated 
a  decline  in  the  incidence  of  nephropathy  in  T1D 
patients  (72-74).  However,  many  T1D  subjects  only 
attain  HbAlC  levels  of  8.5%  well  in  excess  of  the 
recommended  levels  of  blood  glucose.  Regression  of 
microalbuminuria  has  been  observed  upon  blood 
pressure  control,  and  remission  of  overt  nephropathy 
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has  been  observed  in  T1D  patients  treated  for 
hypertension  (5,  75). 

There  is  an  exponential  impact  from  hypertension 
on  increasing  risk  for  renal  disease  (76).  Maintaining 
a  sustained  reduction  in  blood  pressure  is  the  single 
most  influential  intervention  for  reducing  progression 
of  nephropathy  in  T1D  as  well  as  T2D  patients. 
Studies  of  initially  normotensive  T2D  subjects  without 
renal  disease  who  subsequently  developed  and  were 
treated  for  hypertension  indicated  that  patients  with 
blood  pressure  less  than  130/80  mm  Hg  rarely 
developed  nephropathy.  In  contrast,  diabetic  patients 
with  blood  pressure  in  the  range  of  130/80  to  140/ 
90  mm  Hg  exhibit  a  significant  decline  in  GFR 
coincident  with  30%  of  patients  progressing  to  micro¬ 
albuminuria  within  12-15  yr  (77).  Improvements 
in  blood  pressure  treatment  and  glycemic  control 
along  with  decreased  incidence  of  smoking  lead  to 
improved  outcomes  for  T1D  patients. 

The  kidney  structures  implicated  in  the 
pathogenesis  of  the  disease 

The  glomerular  filtration  barrier  in  T1DN 

A  thorough  understanding  of  the  molecular  properties 
of  the  glomerular  filtration  barrier  (GFB)  is  crucial  to 
comprehension  of  the  pathology  of  proteinuria 
encountered  in  T1DN.  Structurally,  the  GFB  is 
composed  of  three  layers  that  mediate  the  process  of 
ultrafiltration  of  blood  flowing  through  anastomosing 
capillary  loops  leading  to  the  formation  of  the  primary 
urine  in  the  Bowman's  capsule  and  finally  into  the 
tubular  system  (Fig.  1).  The  first  layer  is  a  highly 
fenestrated  vascular  endothelium  that  covers  the  wall 
of  the  capillaries.  It  has  been  suggested  that  a  protein 

See  enlargement 

Podocyte  Foot  process  in  Fig. 2 


♦  ♦  * 

Pre-urine 

Fig.  1.  The  GFB.  Ultrafiltration  occurs  when  the  blood  flows 
through  the  glomerular  capillaries.  The  GFB  comprises  the 
fenestrated  endothelial  cell  layer  (red)  and  the  glomerular  capillary 
basement  membrane  (gray).  The  podocytes  (yellow)  loop  to  the 
capillaries  with  their  pedicules  or  foot  processes.  A  protein  complex 
forming  the  slit  diaphragm  covers  the  space  between  the  podocytes 
processes.  GFB,  glomerular  filtration  barrier. 


filtration  role  of  the  endothelium  may  be  critical  to 
proper  kidney  function  because  of  the  barrier  prop¬ 
erties  of  glycocalyx  coating  of  the  endothelial  cells  and 
filling  in  of  the  intracellular  space  (78).  The  second 
layer  is  the  GBM,  forming  the  boundary  between 
blood  and  urine  primarily  composed  of  an  intercon¬ 
necting  network  of  type  IV  collagen  (alpha- 1  and  -2 
during  fetal  development  and  alpha-3,  -4,  and  -5  in  the 
adult)  and  laminin- 1  (consisting  of  alpha- 1 /beta- 1/ 
gamma- 1  chains  in  the  fetus)  and  laminin- 11  (consist¬ 
ing  of  alpha- 5/beta-2/gamma-l  chains  in  the  adult). 
The  third  layer  is  composed  of  highly  specialized  cells, 
the  podocytes  whose  cytoplasm  extends  into  interdig- 
itating  pedicles  or  foot  processes.  The  area  between 
the  podocyte  foot  processes  is  the  slit  diaphragm 
composed  of  a  number  of  cell  surface  proteins 
including  nephrin,  neph  1,  and  P-cadherin.  Under¬ 
standing  GFB  function  has  centered  on  the  role  of  the 
slit  diaphragm  formed  by  these  proteins,  serving  as  the 
ultimate  layer  ensuring  that  large  macromolecules 
such  as  serum  albumin  and  gamma  globulin  remain  in 
the  bloodstream  while  allowing  small  molecules  such 
as  water,  glucose,  and  ionic  salts  to  pass  through  (79). 

The  genesis  of  albuminuria  is  consistent  with 
generalized  vascular  dysfunction,  alterations  in  extra¬ 
cellular  matrix  components,  and  impaired  basement 
membrane  organization  resulting  from  hyperglycemia 
(80).  Increasing  consideration  has  been  given  to  the 
GFB  and  its  central  role  in  the  filtration  of  plasma 
proteins.  High-molecular-weight  tracer  molecules 
have  been  used  to  investigate  the  molecular  sieving 
properties  of  the  GFB.  Ferritin  has  an  estimated 
Stokes  radius  (SR)  of  61  A  and  when  injected  into 
healthy  rats  failed  to  pass  through  the  GBM  (81).  In 
puromycin  aminonucleoside-induced  nephrosis  mod¬ 
els,  which  induce  glomerular  injury  evidenced  by 
flattening  of  podocyte  foot  processes,  ferritin  is  only 
partially  retained  (82).  Smaller  proteins  have  also  been 
used  to  examine  the  selective  permeability  of  the 
GBM.  For  example,  horseradish  peroxidase  (SR  =  30 
A)  has  been  observed  to  pass  through  both  the  GBM 
and  slit  diaphragm,  while  myeloperoxidase  (SR  =  36 
A)  and  catalase  (SR  =  52  A)  are  GBM  permeable 
but  are  captured  by  the  slit  diaphragm  prior  to 
entering  the  urine,  consistent  with  the  hypothesis  that 
for  large  proteins,  GBM  selectivity  is  the  primary 
barrier  separating  the  blood  and  urine  space  (79). 

Genetic  polymorphisms  of  extracellular  matrix  pro¬ 
teins  as  well  as  enzymes  involved  in  their  metabolism 
have  been  suggested  as  possible  explanations  for  the 
variation  in  susceptibility  to  kidney  disease  observed  in 
T1D  patients  (80,  83).  Type  IV  collagens  are  a  principal 
component  of  basement  membranes  (84).  In  the  adult 
glomerulus,  collagens  encoded  by  the  COL4A3, 
COL4A4,  and  COL4A5  loci  replace  the  fetal  collagens 
COL4A1  and  COL4A2  (85).  Absence  of  the  adult 
forms  of  collagen  has  been  associated  with  proteinuria 
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because  of  Alport  syndrome,  a  genetically  determined 
disease  characterized  by  heavy  proteinuria.  In  these 
patients,  the  pathology  is  sustained  by  an  abnormal 
basement  membrane  because  of  defects  and/or  absence 
of  adult  type  IV  collagens  (86,  87). 

SNPs  associated  with  fetal  expressed  collagen 
COL4A1  have  also  been  linked  with  susceptibility  to 
nephropathy.  GBM  abnormalities  characterized  by 
irregularities  in  the  parietal  epithelium  lining  Bow¬ 
man’s  capsule  have  been  observed  in  heterozygous 
mice  containing  a  single  nucleotide  mutation  in 
COL4A1,  resulting  in  a  Gly627Trp  amino  acid 
substitution  in  exon  26  (83)  as  well  as  mice  harboring 
a  G  to  A  polymorphism  in  a  COL4A1 -encoded  splice 
site  resulting  in  loss  of  exon  40  from  the  mRNA 
transcript  (88).  Loss  of  COL4A1  exon  40  has  been 
linked  to  nearly  100%  prevalence  of  albuminuria  in 
animals  by  the  time  they  reach  2  yr  of  age  (89).  In 
humans,  mutations  affecting  GBM  structure  have 
been  linked  with  hereditary  risk  of  nephropathy  (89, 
90).  While  deletion  of  COL4A1  is  lethal,  mice  die  by 
embryonic  day  9.5  (83),  and  heterozygous  individuals 
may  survive  to  adulthood  carrying  increased  risk  of 
microvascular  complications  as  a  result  of  altered 
basement  membrane  structure  occurring  during  early 
development  and  organogenesis. 

The  third  layer  and  ultimate  barrier  for 
plasma  protein  ultrafiltration  is  the  slit 
diaphragm  of  the  podocytes 

Podocytes,  highly  specialized  epithelial  cells,  lie  on  the 
outer  surface  of  GBM  to  which  they  affix  by  cell 
surface  adhesion  proteins  such  as  the  alpha-3  beta-1 


integrin  and  dystroglycan  (91,  92).  The  podocyte  foot 
process  (polipoid  extraflections)  cover  the  outer  layer 
of  the  GBM.  The  intrapodocyte  connections  are 
typified  by  narrow  spaces  (30  40  nm)  covered  by 
a  porous  membrane  known  as  the  slit  diaphragm 
(Fig.  2).  The  slit-diaphragm  protein  complex  is  an 
adherens-type  junction  and  presents  a  dense  zipper¬ 
like  structure  composed  of  the  extracellular  compo¬ 
nents  of  nephrin,  neph  1 ,  P-cadherin  and  FAT  tumor 
suppressor  homolog  1  (FAT  1)  (93,  94).  Podocin  and 
CD2-associated  protein  (CD2AP)  are  localized  in  the 
intercellular  compartment  and  interact  with  the 
extracellular  components  of  the  slit  diaphragm 
anchoring  the  cytoplasm  domain  of  nephrin  to 
cholesterol-rich  regions  of  the  plasma  membrane  and 
to  the  actin  filament  structure  of  the  podocyte 
cytoskeleton  (79,  95).  The  protein  zona  occludens-1 
is  located  at  the  neph  1  insertion  site  of  the  slit 
diaphragm  interacting  with  the  cytoskeleton  as  well  as 
other  components  of  the  cell-cell  junction.  Recent 
findings  have  demonstrated  the  role  of  the  slit- 
diaphragm  adapter  protein  NCK  (96).  NCK  binds 
the  cytoplasmatic  tail  of  nephrin  and  following 
a  phosphorylation  signal  leads  to  cytoskeletal  reor¬ 
ganization.  This  finding  provides  evidence  that  the 
central  role  of  the  cytoplasmatic  region  of  the  nephrin/ 
NCK  complex  is  to  act  in  a  signaling  transduction 
capacity,  transmitting  antiapoptotic  signals  (via  Akt, 
a  serine-threonine  kinase)  (96),  promoting  remodeling 
of  the  cytoskeletal  structure  that  drive  formation  of 
the  actin-based  foot  processes  of  the  podocyte. 

The  key  role  of  the  slit-diaphragm  components  in 
maintaining  correct  permeability  function  of  the  GFB 
is  underscored  by  the  results  obtained  from  animal 
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Fig.  2.  Schematic  view  of  the  molecular  anatomy  of  two  podocyte  foot  processes  (yellow)  with  the  interposed  slit-diaphragm  complex. 
Interference  with  proteins  that  structurally  sustain  the  slit  diaphragm  (e.g.,  CD2AP,  nephrin,  and  podocin)  can  cause  foot  process  effacement 
resulting  in  proteinuria.  Filtrate  from  the  bloodstream  passes  through  the  glycocalyx-filled  fenestrae  located  between  endothelial  cells  lining 
the  GBM.  The  filtrate  then  flows  through  the  GBM  as  well  as  the  slit  diaphragm  and  into  the  Bowman's  space.  The  image  is  modified  and 
adapted  from  Vincenti  and  Ghiggeri  (151)  and  Johnstone  and  Holzman  (152).  CD2AP,  CD2-associated  protein;  FAT  1,  FAT  tumor 
suppressor  homolog  1;  GBM,  glomerular  basement  membrane,  NCK,  NCK  adaptor  protein  1;  ZO-1,  zona  occludens-1. 
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models  and  human  findings.  Mutation  in  genes 
encoding  nephrin,  podocin,  and  oc-actinin  proteins 
leads  to  severe  hereditary  forms  of  nephrotic  syn¬ 
drome  in  humans  (97,  98).  Mice  lacking  CD2AP  (99) 
develop  nephrotic  syndrome,  and  mutations  in 
CD2AP  were  found  in  two  human  patients  with  focal 
segmental  glomerulosclerosis.  NCK-deficent  animals 
fail  in  the  formation  of  podocyte  foot  processes  (96). 

Progress  in  elucidating  podocyte  biology  and 
signaling  have  shown  that  the  complex  of  proteins 
composing  the  slit  diaphragm  is  critical  to  maintaining 
podocyte  architecture  as  well  as  supporting  the  overall 
function  of  the  glomerular  filter  of  the  kidney.  For 
T1D,  as  well  as  T2D,  nephropathy,  there  are  reports 
of  a  reduced  number  and  density  of  the  podocytes 
and  an  increase  in  foot  process  width  (100-105). 
Steffes  et  al.  (102)  has  reported  loss  of  podocytes 
occurring  early  in  T1D,  while  other  authors  detected 
the  loss  of  podocytes  continuing  later  in  disease  pro¬ 
gression  (104).  The  loss  of  podocytes  from  the  glo¬ 
merular  space  has  been  correlated  with  increased 
albumin  excretion. 

A  loss  of  nephrin  along  with  the  onset  of  micro¬ 
albuminuria  occurs  in  early  stages  of  DN  in  T1D 
patients  (106).  The  loss  of  nephrin  was  detected  even 
before  the  appearance  of  microalbuminuria,  suggest¬ 
ing  that  damage  to  the  slit  diaphragm  may  occur  early 
in  disease  pathogenesis.  Other  studies  confirmed 
reduction  of  nephrin  and  other  slit-diaphragm  pro¬ 
teins  in  the  glomeruli  of  patients  with  both  T1D  and 
T2D  nephropathies  (107-109).  Koop  et  al.  (108) 
reported  on  DN  in  six  patients,  observing  statistically 
significant  decrease  in  protein  level  of  nephrin  along 
with  reduced  expression  of  podocin  and  podocalyxin. 
Nephrin  mRNA  expression  inversely  correlated  with 
increasing  severity  of  proteinuria  (108). 


The  tubular  system 

T1DN  is  often  considered  a  glomerular  disease; 
however,  progression  to  end-stage  organ  failure  also 
correlates  with  increased  level  of  renal  tubulointer¬ 
stitial  fibrosis  (110,  111).  Changes  occurring  in  the 
renal  tubular  system  have  been  implicated  as  a  primary 
contributor  to  the  process  of  renal  cortex  fibrosis 
observed  in  DN  (112).  Increased  fibrosis  correlates 
with  macroalbuminuria,  nephron  tubular  cell  injury, 
induction  of  apoptosis,  and  hypertension  (110,  111, 
113-117). 

Renal  tubular  system  epithelial  cells  and  interstitial 
tissue  undergo  changes  during  progression  of  T1DN. 
These  include  tubulointerstitium  alterations,  tubular 
hypertrophy,  and  basement  membrane  alterations 
that  precede  tubulointerstitial  fibrosis.  Tubular  cell 
hypertrophy  is  an  early  feature  of  T1DN.  Exposure  of 
proximal  tubular  epithelial  cells  to  hyperglycemia 


occurs  as  a  result  of  glycosuria  as  well  as  from  high 
glucose  concentration  present  in  interstitial  tissue. 
Cellular  response  to  elevated  glucose  includes  increased 
tubular  basement  membrane  thickening  as  a  result  of 
increased  synthesis  of  type  IV  collagen  and  fibronectin 
as  well  as  altered  matrix  protein  metabolism  as  a  result 
of  increased  expression  of  metalloproteinase  inhibitors 
tissue  inhibitor  of  metalloproteinase  (TIMP)-l  and 
TIMP-2  (114,  115,  118).  Changes  to  the  tubulointer¬ 
stitium  co-occur  along  with  changes  to  the  glomeru¬ 
lus.  Poor  filtration  of  the  primary  urine  may  lead  to 
presentation  of  cytotoxic  compounds  to  the  nephron 
tubular  cells.  Combining  the  deleterious  affects  of 
glycosuria  with  exposure  to  cytokines  may  stimulate 
cellular  responses  leading  to  organ  failure. 

Primary  filtration  of  urine  occurs  at  the  slit 
diaphragm  of  the  GFB  (45);  however,  glucose  and 
sodium  ions  are  freely  filtered.  Glucose  is  entirely 
reabsorbed  in  the  proximal  tubule  by  a  sodium- 
glucose  cotransport  system.  Recovery  of  greater  than 
99%  of  filtered  sodium  ion  occurs  within  the  nephron 
via  the  action  of  specialized  epithelial  cells  lining  the 
renal  tubular  system  proximal  tubule  (responsible  for 
50%  sodium  absorption),  thick  ascending  loop  of 
Henle  (30  40%  sodium  absorption),  distal  convoluted 
tubule  (5%  sodium  absorption),  and  the  collection 
duct  (4%  sodium  absorption)  (Fig.  3).  Sodium  reup¬ 
take  by  the  distal  convoluted  tubule  and  collection 
duct  account  of  a  small  percentage  of  total  salt 
reabsorption  but  are  the  principal  sites  at  which  net 
salt  balance  is  determined  (119).  A  widely  held  model 
for  explaining  the  role  of  the  kidney  in  blood  pressure 


Glomerulus 


Loop  of  Henle  ▼ 

Urine  Excretion  Na+<1% 

Fig.  3.  A  schematic  view  of  sodium  reabsortion  along  the  nephron. 
Urine  is  processed  via  the  PCT,  TAL,  and  DCT.  The  juxtaglo¬ 
merular  apparatus  comprises  the  macula  densa  cells,  which 
represent  specialized  tubular  cells  at  the  end  of  the  thick  ascending 
limb,  comprising  cells  from  the  extraglomerular  mesangium, 
vascular  smooth  muscle  cells,  and  renin-secreting  cells  in  the  media 
of  the  afferent  glomerular  arteriole.  DCT,  distal  convoluting  tubule; 
PCT,  proximal  convoluting  tubule;  TAL,  thick  ascending  limb. 
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control  is  through  its  role  balancing  sodium  excretion 
and  recovery.  Hypertension  results  from  impaired 
blood  pressure  control  and  correlates  with  changes  in 
the  kidney  that  lead  to  impaired  renal  function,  i.e., 
inability  to  maintain  sodium  ion  homeostasis. 

Increased  kidney  mass  is  an  early  feature  of  DN 
(120)  with  expansion  of  the  proximal  tubular  cells 
accounting  for  most  of  the  growth  (121,  122).  Along 
with  tubular  cell  expansion  there  is  increased  sodium 
recovery.  Vallon  et  al.  (123)  have  suggested  a  model  in 
which  increased  sodium  recovery  by  the  proximal 
convoluted  tubule  causes  glomerular  hyperfiltration, 
leading  to  nephropathy.  The  primary  increase  in 
proximal  tubular  reabsorption  in  early  diabetes  may 
be  because  of  proximal  tubular  reabsorption  through 
enhanced  sodium-glucose  cotransport.  Studies  on 
patients  with  T1DN  have  indicated  increased  sodium 
recovery  occurring  primarily  within  nephron  tubular 
segments  upstream  from  the  macula  densa  in  the 
distal  convoluted  tubule  (124,  125)  and  have  indicated 
that  there  is  increased  tubular  recovery  of  sodium  ions 
occurring  in  patients  with  early-stage  T1D. 

The  juxtaglomerular  apparatus  comprises  different 
structures  in  functional  and  structural  link:  cells  of  the 
extraglomerular  mesangium,  which  fill  the  angle 
between  the  afferent  and  the  efferent  glomerular 
arteriole;  vascular  smooth  muscle  cells  and  renin- 
secreting  cells  in  the  media  of  the  afferent  glomerular 
arteriole;  and  the  macula  densa.  The  macula  densa  is 
an  area  of  specialized  cells  lining  the  region  of  the 
distal  convoluted  tubule  next  to  the  glomerular 
vascular  pole.  These  cells  are  sensitive  to  the  ionic 
content  and  water  volume  of  the  fluid  in  the  distal 
convoluting  tubule.  An  increase  or  decrease  in 
sodium,  chloride,  and  potassium  uptake  elicits  inverse 
changes  in  the  single-nephron  GFR  by  altering  the 
vascular  tone,  predominantly  of  the  afferent  arteriole. 
The  mechanism  serves  to  establish  an  appropriate 
balance  between  GFR  and  tubular  reabsorption 
upstream  from  the  macula  densa.  As  the  proximal 
tubule  grows,  more  of  the  glomerular  filtrate  sodium 
is  reabsorbed  and  less  reaches  the  macula  densa  at  the 
end  of  Henle’s  loop  causing  GFR  to  increase  through 
the  normal  physiologic  actions. 

The  link  between  increased  tubular  sodium  recovery 
and  diabetic  kidney  disease  is  echoed  in  observations 
obtained  from  studies  involving  hyperfiltering  T1D 
patients  in  which  there  is  reported  a  positive  correla¬ 
tion  between  GFR  and  sodium  recovery  in  the 
proximal  tubular  region  of  the  nephron  (125). 
Examination  of  T1D  patients  have  indicated  that 
hyperfiltration  by  the  nephron  may  continue  even 
once  blood  glucose  levels  are  controlled  by  intensive 
insulin  therapy  (126-128)  and  may  continue  to 
progress  independent  of  glucose  control  as  a  result 
of  persisting  enlargement  of  the  nephron  tubular 
compartment. 


Molecular  pathways 

Pathways  leading  to  fibrogenesis 

The  interplay  between  glomerular  hypertension  and 
hyperglycemia  may  be  linked  to  the  pathological 
changes  seen  in  T1DN.  In  non-diabetic  individuals, 
glomeruli  are  protected  from  deleterious  effects 
associated  with  essential  hypertension  by  interacting 
autoregulatory  mechanisms  (129).  Increased  blood 
pressure  is  associated  with  increased  preglomerular 
resistance  thus  isolating  glomerular  capillaries  from 
potential  damage.  However,  the  ability  to  regulate 
glomerular  blood  flow  is  impaired  when  significant 
ablation  of  renal  mass  occurs,  resulting  in  increased 
single-nephron  GFR.  Carmines  et  al.  (130)  demon¬ 
strated  that  regulatory  mechanisms  are  damaged  by 
hyperglycemia-induced  inhibition  of  voltage-gated 
calcium  channels.  Endothelial  cell  injury  as  a  result 
of  hypertensive  glomerular  damage  may  be  responsi¬ 
ble  for  initiating  glomerulosclerosis,  with  the  process 
involving  increased  expression  of  transforming  growth 
factor  (TGF)-beta  and  extracellular  matrix  proteins 
(131).  The  accumulation  of  excess  extracellular  matrix 
within  the  glomerulus  and  the  interstitium  is  associ¬ 
ated,  at  least  in  part,  with  the  profibrotic  cytokine, 
TGF-|3  (132-134). 

The  observation  that  conditions  characterized  by 
a  severe  but  relatively  selective  albuminuria,  such  as 
minimal  change  nephropathy,  do  not  initiate  intersti¬ 
tial  fibrosis  led  to  the  study  of  the  role  of  higher 
molecular  weight  ultraflltrated  proteins  in  DN.  Ultra- 
filtrated  growth  factors,  such  as  insulin-like  growth 
factor  1  (IGF-1),  hepatocyte  growth  factor  (HGF), 
and  TGF-(S.  may  have  a  direct  effect  on  the  nephron 
tubular  epithelial  cells  by  inducing  transcription  of 
genes  with  profibrogenic  effects,  whereas  other 
cytokines  such  as  bone  morphogenic  protein  7 
(BMP-7)  act  as  inhibitors  of  these  pathways  (135) 
(Fig.  4).  The  importance  of  HGF/IGF-1  action  is 
underscored  by  the  demonstration  of  a  direct  effect  of 
these  hormones  on  renal  hypertrophy,  microalbumi¬ 
nuria,  and  glomerulosclerosis  (136)  and  that  increased 
excretion  rate  of  both  urinary  IGF-1  and  urinary 
HGF  strongly  correlate  with  microalbuminuria  and 
kidney  volume  in  T1D  patients  (137). 

Pathophysiological  mechanisms  leading  to  sclerosis 
may  be  related  to  simultaneous  exposure  to  proscler- 
otic  cytokines  in  glomeruli  and  tubulointerstitium  as 
well  as  tubulotoxicity  caused  by  protein  contents  in 
filtrate  overloading  the  proximal  tubule’s  absorption 
ability.  Proteinuria  is  proposed  to  be  more  than 
a  highly  reliable  predictive  factor  having  a  causal  role 
during  disease  progression  (138).  Ultraflltrated  pro¬ 
teins  undergo  endocytosis  by  the  proximal  tubular 
cells,  resulting  in  a  cytotoxic  effect  directly  leading  to 
tubular  damage.  The  interaction  of  filtrated  protein 
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Glomerular  filtration  barrier 


Fig.  4.  Interstitial  fibrogenesis.  When  glomerular  proteinuria  oc¬ 
curs,  excess  filtrated  proteins  undergo  endocytosis  by  the  PTC  (in 
blue),  resulting  in  cytotoxic  effects  and  tubular  damage  (red  PTC). 
Ultrafiltrated  growth  factor  cytokines  (IGF-1,  TGF-fS,  and  HGF) 
activate  a  number  of  genes,  enabling  PTC  to  directly  contribute  to 
interstitial  fibrosis  by  producing  ECM  proteins.  Tubular  cell 
interaction  with  growth  factor  cytokines  elicited  production  of 
other  cytokine  signals  (PDGF  and  CTGF)  able  to  induce  ECM 
production  by  interstitial  cells.  Other  signals  such  as  BMP-7  act  as 
inhibitors  of  these  pathways.  BMP-7,  bone  morphogenic  protein  7; 
CTGF,  connective  tissue  growth  factor;  ECM,  extracellular  matrix; 
FIGF,  hepatocyte  growth  factor;  IGF-1,  insulin-like  growth  factor 
1;  PDGF,  platelet-derived  growth  factor;  PTC,  proximal  tubular 
cells;  TGF-P,  tissue  growth  factor-beta. 

with  proximal  tubular  cells  could  be  one  of  the 
initiating  signals  enabling  a  cascade  of  events  leading 
to  changes  in  extracellular  matrix  composition  and 
eventually  to  interstitial  fibrogenesis. 

Candidate  genes  and  their  pathways 

Studies  of  candidate  genes  for  T1DN  have  focused  on 
genes  controlling  the  balance  between  extracellular 
matrix  protein  synthesis  and  degradation,  glucose 
metabolism,  as  well  as  those  whose  products  control 
the  interaction  between  kidney  function  and  blood 
pressure  control.  Genes  influencing  the  activity  of 
growth  factors,  protein  kinase  C  activation,  hormones 
regulating  blood  pressure,  accumulation  of  advanced 
glycosylation  end  products,  aldose  reductase  pathway 
flux,  and  altered  glucose  transport  have  each  been 
extensively  evaluated.  The  roles  of  hypertension 
candidate  genes  have  also  been  studied,  although 
most  of  the  work  has  focused  on  the  renin-angioten¬ 
sin  system  (RAS). 

Candidate  gene  studies  have  evaluated  both  func¬ 
tional  (i.e.,  nonsynonymous  codons)  and  allelic  mark¬ 
ers  linked  statistically  for  their  influence  on  T1DN. 
However,  this  approach  has  as  yet  to  convincingly 
identify  risk  factors  when  replicated  in  independent 
cohorts  (139).  The  failure  to  replicate  first-stage  re¬ 
sults  from  carefully  controlled  genetic  studies  possibly 
reflects  the  confounding  influence  of  environmental 


factors  on  statistical  power  to  detect  a  true  positive 
signal.  Strategies  for  the  selection  of  candidate  genes 
are  improved  by  consideration  of  linkage  scans  along 
with  relevant  biologic  information  about  the  impli¬ 
cated  loci.  For  example,  chromosome  3q  has  been 
identified  in  multiple  linkage  studies  for  T1DN  (48, 
140).  This  region  of  the  genome  has  also  been 
identified  in  studies  of  T2DN  in  Pima  Indians  (141) 
and  in  African-American  families  (142).  Other  chro¬ 
mosomal  regions  have  also  been  identified  by  linkage 
analysis  to  T1DN,  but  chromosome  3q  is  strongly 
implicated  in  hypertension  (48)  and  hypertension  is 
frequently  found  to  co-occur  with  T1DN.  A  number 
of  candidate  genes  are  contained  within  the  region  of 
chromosome  3q  implicated  in  these  studies  (Tables  3 
and  4).  However,  one  of  the  possible  candidates,  the 
succinate  receptor  (SUCNR1),  has  been  linked  with 
carbohydrate  catabolism  and  with  control  of  blood 
pressure.  He  et  al.  (143)  reported  that  the  orphan 
G-protein-coupled  receptor  encoded  by  the  SUCNR1 
locus  detected  succinate,  which  is  a  metabolic  inter¬ 
mediate  produced  by  the  tricarboxylic  acid  (TCA) 
cycle  during  respiration,  and  increased  succinate  levels 
were  causal  for  increased  blood  pressure  in  mice.  In 
diabetic  patients,  it  is  possible  that  local  mismatch 
of  energy  supply  and  demand,  altered  metabolism 
of  TCA-cycle  intermediates,  or  injury  may  lead  to 
mitochondrial  dysfunction  and  the  release  of  succinate 
into  circulation.  Succinate  molecules  activate  recep¬ 
tors  in  the  kidney,  causing  the  release  of  renin  and 
activation  of  the  RAS.  The  RAS  leads  to  an  increase 
in  blood  pressure  and  altered  local  blood  flow.  In 
healthy  individuals,  this  system  might  act  to  regulate 
local  blood  flow  to  match  metabolic  demands. 
However,  in  diabetic  patients  with  impaired  metabolic 
control,  it  might  also  result  in  hypertension  or  altered 
cellular  function  and  may  contribute  to  renin-medi¬ 
ated  constriction  of  the  renal  artery  (144). 

Many  other  candidate  genes  have  been  examined 
during  genetic  evaluation  of  T1DN.  For  example, 
Ewens  et  al.  (90)  investigated  110  candidate  genes  by 
transmission/disequilibrium  testing  (TDT)  (145)  using 
72  family  trios  of  which  the  offspring  experienced 
T1DN.  Genes  were  chosen  based  on  the  results  of 
previous  studies  associating  various  regions  of  the 
human  genome  to  the  phenotype  of  T1DN  and  have 
implicated  proteins  expressed  within  the  GFB.  Extra¬ 
cellular  matrix  proteins  encoded  by  the  loci  COL4A1, 
LAMA4,  and  LAMC1  comprise  important  compo¬ 
nents  of  GBM  forming  the  blood-urine  barrier  in  the 
kidney.  Genetic  variants  linked  to  different  alleles  of 
the  COL4A1  locus  have  been  studied  by  TDT  sta¬ 
tistical  analysis  supporting  the  hypothesis  for  genetic 
linkage  of  COL4A1  (p  value  =  0.0002)  to  the  ne¬ 
phropathy  phenotype  along  with  additional  linkage  to 
the  laminin-encoding  loci,  LAMA4  (p  value,  0.016) 
and  LAMC1  (p  value,  0.026)  (90). 
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Concluding  remarks 

The  goal  of  reducing  the  incidence  of  T1D  complica¬ 
tions  has  been  intensively  investigated.  T1DN  is  the 
principal  etiology  leading  to  ESRD.  The  molecular 
mechanism! s)  underlying  the  disease  involve  an  inter¬ 
play  between  genes  and  gene-environmental  exposures. 
Improved  understanding  of  the  disease  pathogenesis 
will  lead  to  better  identification  of  those  at  risk  and  by 
doing  so  lead  to  improved  treatment  regimens. 

The  evidence  for  a  dominant  genetic  role  in 
determining  susceptibility  to  kidney  disease  in  T1D 
patients  is  primarily  the  result  of  epidemiological 
studies  indicating  that  prevalence  of  DN  increased 
during  the  first  15  yr  after  onset  of  T1D.  After  20  yr 
duration  of  diabetes,  the  incidence  of  new  cases  of 
nephropathy  among  T1D  patients  plateaus  and  in  fact 
may  decrease  (3,  19,  20).  These  observations  have  fre¬ 
quently  been  interpreted  as  indicating  that  there  exists 
a  subset  of  patients  susceptible  to  develop  kidney 
disease.  Additional  evidence  for  genetic  risk  has  been 
obtained  from  family  studies,  showing  the  clustering 
of  DN  among  T1D  siblings  (21-24).  Siblings  experi¬ 
encing  T1D  have  a  significantly  increased  risk  for  DN 
when  the  T1D  proband  experiences  the  disease. 
Interpretation  of  the  data  generated  from  genetic 
analysis  of  T1DN  is  complicated  by  the  possibility 
that  signals  are  related  to  coincident  diseases  (e.g., 
hypertension)  as  well  as  environmental  exposures  such 
as  smoking.  In  order  to  interpret  these  results,  it  is 
necessary  to  compare  the  results  of  genetic  testing  of 
control  populations  as  well  as  replication  of  the  results 
in  independently  recruited  case  cohorts. 

Replication  of  genome-wide  linkage  and  candidate 
gene  studies  for  T1DN  have  been  limited  by  lack  of 
adequate  sample  size.  Recruitment  of  large  indepen¬ 
dent  cohorts  for  T1DN  genetic  analysis  is  required  to 
attain  statistical  power  to  detect  true  genetic  signals. 
Several  initiatives  are  ongoing  to  address  this  issue. 
The  most  recent  of  these  studies  is  the  genetics  of 
kidneys  in  diabetes  (GoKinD),  which  has  provided 
a  collection  of  DNA  samples  and  relevant  clinical 
information  from  greater  than  3000  study  participants 
(146).  A  genome -wide  association  study  of  the 
GoKinD  cohort  has  recently  been  funded  as  part  of 
the  Genetic  Association  Information  Network’s  col¬ 
laborative  effort  to  evaluate  genes  in  complex  diseases 
(http://www.fnih.org/GAIN/GAIN_home.shtml).  The 
initial  results  of  a  500  000  SNP  genome-wide  associ¬ 
ation  scan  based  on  the  GoKinD  cohort  is  due  in  the 
fall  of  2007.  Collection  of  additional  cohorts  is 
ongoing  at  a  number  of  institutions  investigating 
T1DN  genetic  risk.  Careful  collection  practices  cen¬ 
tered  on  recording  relevant  health  history  of  partic¬ 
ipants  are  essential  for  success.  Knowledge  of  patient 
environmental  exposures  (e.g.,  history  of  hypertension, 
use  of  anti-hypertensive  medications,  and  smoking)  are 


as  essential  as  recording  T1D  history  and  will  greatly 
aid  in  stratification  of  populations  when  comparing 
participants  recruited  at  different  centers. 
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On  the  Use  of  General  Control  Samples 
for  Genome-wide  Association  Studies: 

Genetic  Matching  Highlights  Causal  Variants 
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Resources  being  amassed  for  genome-wide  association  (GWA)  studies  include  "control  databases"  genotyped  with  a  large-scale  SNP  array. 
How  to  use  these  databases  effectively  is  an  open  question.  We  develop  a  method  to  match,  by  genetic  ancestry,  controls  to  affected 
individuals  (cases).  The  impact  of  this  method,  especially  for  heterogeneous  human  populations,  is  to  reduce  the  false-positive  rate, 
inflate  other  spuriously  small  p  values,  and  have  little  impact  on  the  p  values  associated  with  true  positive  loci.  Thus,  it  highlights 
true  positives  by  downplaying  false  positives.  We  perform  a  GWA  by  matching  Americans  with  type  1  diabetes  (T1D)  to  controls 
from  Germany.  Despite  the  complex  study  design,  these  analyses  identify  numerous  loci  known  to  confer  risk  for  T1D. 


Introduction 

Systematic  GWA  studies  are  critically  dependent  on  the 
availability  of  very  large  and  well-characterized  control 
populations.  With  a  different  degree  of  structure  in  mod¬ 
ern  populations,  ideally,  multiple,  diverse,  and  large  con¬ 
trol  populations  will  be  used.  As  platforms  for  GWA  be¬ 
come  standardized,  numerous  sources  of  pregenotyped 
control  individuals  are  becoming  available.  Typically, 
many  more  controls  are  available  than  cases,  and  we  be¬ 
lieve  these  controls  can  be  advantageous  for  discovering 
risk  loci  and  for  controlling  the  false-positive  rate.  For  ex¬ 
ample,  the  data  analyzed  here  include  416  Americans  of 
European  descent  diagnosed  with  T1D  (MIM  222100) 
and  a  control  database  of  2159  individuals  from  different 
regions  of  Germany. 

Ancestry  matching  based  on  nongenetic  variables1  and 
SNP  genotypes2  for  genetic-association  studies  has  been 
proposed  previously.  Our  approach,  which  we  call  genetic 
matching  or  GEM,  goes  further  in  that  we  show  how  to 
systematically  obtain  favorable  matching  by  using  a  panel 
of  genetic  markers  and  how  to  determine  outlying  individ¬ 
uals  as  well  as  individuals  that  cannot  be  successfully 
matched  to  others  in  the  available  registry.  By  simulations, 
we  will  contrast  matching  to  a  commonly  used  method  for 
controlling  the  confounding  of  ancestry,  namely  the  use 
of  eigenvector  analysis3  via  Eigenstrat4  to  identify  predic¬ 
tors  of  ancestry;  for  the  real  data,  we  contrast  matching 
to  both  Eigenstrat  and  identification  of  common  ancestry, 
such  as  European  American. 


We  propose  matching  on  the  basis  of  genetic  similarities 
derived  from  eigenvector  decomposition  (EVD),  making 
our  initial  analyses  similar  to  that  taken  in  Eigenstrat.4 
The  best  known  form  of  matching  is  matched  pairs 
(pMatch);  however,  assuming  the  criterion  for  matching 
are  sufficient  to  remove  the  effects  of  unmeasured  con¬ 
founding,  an  alternative  to  matched  pairs  known  as  full 
matching  (fMatch)  is  optimal.5  Consider  a  scenario  in 
which  three  cases  (a,  b,  and  c)  and  three  controls  (x,  y, 
and  z)  fall  into  two  distinct  ancestral  clusters  (a,  x,  and  y) 
and  (b,  c,  and  z).  Matching  pairs  creates  three  strata, 
(a  and  x),  (c  and  z),  and  (b  and  y),  but  the  pair  (b  and  y) 
does  not  define  a  homogeneous  strata.  Alternatively, 
fMatch  minimizes  the  total  distance  between  individuals 
within  strata  with  the  constraint  being  that  each  stratum 
includes  a  single  case  and  one  or  more  controls,  or  vice 
versa,  i.e.,  clusters  (a,  x,  and  y)  and  (b,  c,  and  z).  Of  the 
two,  fMatch  is  optimal  because  case  and  control  samples 
are  unlikely  to  have  identical  distributions  of  ancestry, 
and  in  this  situation,  forcing  each  case  to  match  a  unique 
control  leads  to  suboptimal  matches.  (pMatch  can  be  very 
useful,  however,  in  designing  follow-up  studies  that  require 
preselection  of  case  and  control  samples.) 

In  large  association  studies,  the  sample  typically  includes 
some  individuals  with  widely  varying  ancestry.  EVD  is 
highly  sensitive  to  outlying  observations.  A  few  points 
lying  far  from  the  majority  of  the  data  can  determine  mul¬ 
tiple  principal  axes  of  the  representation.  Indeed,  outliers 
can  obscure  the  discovery  of  axes  that  potentially  separate 
the  data  into  distinct  types.  For  this  reason,  individuals 
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Figure  1.  Flowchart  for  Genetic-Match¬ 
ing  Algorithm  Illustrated  with  Portions 
of  the  T1D  Data 

Distances  between  individuals  are  deter¬ 
mined  by  the  major  axes  of  variation  in 
the  EVD  representation.  Outlier  removal, 
illustrated  by  (A),  is  critical  for  revealing 
the  subtle  variability  between  individuals 
of  similar  ancestry.  After  major  outliers 
are  removed,  clustering  is  used  for  discov¬ 
ery  of  homogeneous  clusters;  four  distinct 
clusters  are  displayed  here  (B),  pLotted  as 
principal  component  axes.  Two  of  these 
clusters  are  displayed  before  ( [C] ,  left) 
and  after  ([C],  right)  rescaling  of  axes. 
Some  observations  are  not  outliers,  but 
they  are  unmatchabLe  ([D],  left);  for  exam¬ 
ple,  the  isolated  case  in  the  center  of  the 
plot.  Rescaled  distances  are  compared  to 
distances  expected  in  homogeneous  sam¬ 
ples  ([D],  right)  to  identify  cases  and  con¬ 
trols  that  can  not  be  successfully  matched. 
Association  analysis  is  performed  on  match¬ 
ed  strata  so  that  the  effects  of  population 
structure  couLd  be  removed  (not  shown). 


Eigenvector! 

having  highly  unusual  measures  on  any  of  the  major  eigen¬ 
vectors  are  removed.4  Likewise,  with  matching  it  is  necessary 
to  determine  which  strata  span  an  unusual  distance  leading 
to  "unmatchable  individuals."  If  the  controls  are  more  nu¬ 
merous  than  the  cases,  they  typically  span  a  larger  range  of 
ancestries  than  cases,  and  it  should  be  possible  to  find  one 
or  more  controls  similar  to  each  case.  Conversely,  some  cases 
may  have  to  be  removed  to  account  for  the  effects  of  struc¬ 
ture.  In  this  work,  we  formalize  the  notions  of  outlying 
and  unmatchable  individuals  and  propose  a  method  to  dis¬ 
cover  the  key  axes  that  describe  the  population  structure. 

Material  and  Methods 

A  Sketch  of  the  Matching  Procedure  Employed 
by  GEM  and  Displayed  in  Figure  1 

The  illustration  (Figure  1)  shows  the  steps  involved  in  matching 
genotyped  cases  and  controls.  To  begin,  create  an  L  SNPs  and  N 
individuals  matrix  of  scaled  allele  counts  from  which  the  EVD 
is  computed  (see  Appendix).  The  top  D  eigenvectors  form  a 


D  dimensional  map  describing  the  "ances¬ 
try"  of  each  individual,  i.e.,  the  mapping 
of  the  ith  subject  in  each  dimension  is  de¬ 
termined  by  the  ith  element  the  in  rith  ei¬ 
genvector.  The  dth  eigenvalue  determines 
the  importance  of  the  rfth  dimension  in 
the  new  representation  of  the  data.  Indi¬ 
viduals  of  similar  ancestry  map  to  similar 
values  in  the  eigenvectors  associated  with 
large  eigenvalues  (Figure  1).  Eigenvectors 
associated  with  small  eigenvalues  have  lit¬ 
tle  or  no  genetic  interpretation. 

For  a  homogeneous  population,  the  largest  eigenvalues  provides 
the  basis  for  a  significance  test  for  population  structure  (see  Patter¬ 
son6  and  Appendix).  Applying  this  test  with  significance  level  a  = 
0.01,  we  determine  the  number  of  dimensions  D  to  be  used  in  the 
eigenvector  representation.  The  EVD  determines  the  distance  be¬ 
tween  individuals  on  the  basis  of  the  top  D  eigenvectors,  serving 
as  coordinates  or  dimensions,  and  eigenvalues  serving  as  weights 
to  exaggerate  differences  in  dimensions  of  greater  importance 
(see  Appendix). 

If  the  data  have  many  outliers,  D  will  be  relatively  large,  and 
the  principal  eigenvectors  will  be  poorly  estimated.7  Outliers  (Fig¬ 
ure  1A)  can  be  removed  with  visual  diagnostics  or  the  criterion 
from  Eigenstrat;4  namely,  remove  any  individuals  with  ancestry 
coefficients  greater  than  6  SDs  in  at  least  one  of  the  D  eigenvector 
axes.  After  removing  outliers,  the  EVD  should  be  recomputed.  If 
the  estimated  dimension,  D,  is  still  greater  than  two  or  three,  we 
suggest  finding  the  distance  between  nearest  pairs  of  controls 
and  cases.  A  subject  with  ancestry  that  does  not  lend  itself  to 
matching  will  appear  as  an  outlier  via  this  criterion  and  should 
be  removed  (see  the  T1D  example). 

To  determine  how  to  match  and  which  individuals  are  unmatch¬ 
able,  we  rely  on  the  distribution  of  distances  between  individuals 
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in  a  homogeneous  population.  For  a  homogeneous  sample,  the 
distribution  of  distances  will  depend  on  sample  size  N  and  the 
number  of  locii.  By  using  simulations,  we  can  find  the  distribution 
of  distances  for  a  homogeneous  population.  These  simulations  also 
yield  the  distribution  of  eigenvalues  for  a  homogeneous  sample  of 
size  N. 

Real  populations  are  heterogeneous  but  can  be  modeled  as 
mixtures  of  relatively  homogeneous  subpopulations  (Figure  IB). 
We  wish  to  represent  these  subpopulations  so  that  the  between- 
subject  distances  within  a  homogeneous  subpopulation  are  compa¬ 
rable  to  expectation  if  the  entire  sample  were  homogeneous.  To  do 
so,  we  need  to  model  the  underlying  population  substructure  and 
adjust  real  data  so  that  they  are  scaled  properly  (Figure  1C);  other¬ 
wise,  the  between  subpopulation  variance  will  cause  distances  be¬ 
tween  individuals  to  be  poorly  calibrated  (Figure  1C).  We  do  this 
via  a  two-stage  algorithm  involving  clustering  and  scaling.  In  stage 
one,  we  cluster  individuals  that  appear  to  have  common  ancestry. 
This  is  done  iteratively,  by  addition  of  clusters  and  then  testing 
for  structure  (see  Appendix  for  testing)  until  each  cluster  is  homo¬ 
geneous.  We  use  Ward's  algorithm8,9  to  form  hierarchical  groups 
of  mutually  exclusive  subsets  based  on  the  first  D  axes  of  the 
EVD.  We  need  a  stopping  rule  for  choosing  K,  the  number  of  clus¬ 
ters.  Start  with  K  =  2  and  apply  the  test  for  population  structure 
on  each  of  the  clusters  (a  =  0.001).  Homogeneous  clusters,  as 
judged  by  the  significance  test,  are  set  aside,  and  Ward's  algorithm 
is  applied  only  to  the  remaining  data.  Repeat  this  process,  increas¬ 
ing  K  until  all  the  clusters  are  homogeneous  or  consisting  of  too  few 
observations  (~20).  Finally,  we  rescale  interindividual  distances  as 
described  in  the  Appendix  so  that  they  are  comparable  to  distances 
found  in  a  homogeneous  population.  At  this  rescaling  step,  un- 
matchable  individuals  are  uncovered  and  removed  (Figure  ID). 

After  outliers  and  unmatchable  individuals  are  removed  from 
the  sample,  recalculate  the  EVD  and  determine  D.  Reverting 
back  to  unsealed  eigenvectors,  find  the  distance  between  cases 
and  controls  on  the  basis  of  the  Euclidean  distance  with  D  dimen¬ 
sions  as  described  in  the  Appendix.  Match  strata  with  either  full 
match  or  pair  match.  Software  implementing  matching  algo¬ 
rithms  is  widely  available  (e.g.,  we  use  the  optmatch  function  in 
the  statistical  package  R).  Then,  the  data  can  be  analyzed  for  dis¬ 
ease  and  SNP  association  by  conditional  logistic  regression.  Other 
covariates  can  be  entered  into  the  model  at  this  point. 

T1D  Analyses 

Purified  samples  of  genomic  DNA  were  obtained  from  the  Genetics 
of  Kidneys  in  Diabetes  (GoKinD)  study10  and  from  T1D  patients 
recruited  at  the  Children's  Hospital  of  Pittsburgh  (CHP)  and  Uni¬ 
versity  of  Pittsburgh  Medical  Center.  The  study  employed  a  human 
gene-chip  microarray  (Affymetrix,  Santa  Clara,  CA)  for  evaluation 
of  genetic  variants  with  DNA  samples  from  T1D  (case)  participants 
with  genetic  typing  data  obtained  from  the  KORA11  and  PopGen12 
"control"  cohorts.18  Genotyping  results  were  obtained  with  the 
same  Affymetrix  500K  SNP  typing  array;  however,  assays  for  case 
and  control  cohorts  were  performed  independently.  Case  partici¬ 
pants  (n  =  416)  were  recruited  in  the  U.S.,  with  self-declared  Euro¬ 
pean  ancestry  and  T1D;  control  participants  (n  =  2159)  were 
citizens  of  Germany  recruited  independent  of  phenotype  (Table  1). 
Recruitment  of  participants  at  CHP  was  governed  by  the  human 
subjects  protocol  approved  by  the  University  of  Pittsburgh  Institu¬ 
tional  Review  Board  (IRB  #011052:  New  Advanced  Technology 
to  Improve  Prediction  of  Type  1  Diabetes).  CHP  patients  (n  =  28) 
consented  to  providing  10  ml  blood  obtained  by  vein  puncture  as 
well  as  a  brief  medical  history  relating  to  onset  of  T1D.  The  GoKinD 


Table  1.  Characteristics  of  Case  and  Control  Participants 

Case  Participants 

Control  Participants 

Demographic 

Characteristics 

CHP 

GoKinD 

KORA 

POPGEN 

Number  of  singletons 

28 

394 

1644 

500 

Nominal  European 
American  (%) 

100% 

100% 

— 

— 

German  residents  (%) 

— 

— 

100% 

100% 

Male  gender  (%) 

50% 

46.7  % 

49.5% 

51.8% 

H i sto ry  of  Diabetes 

Type  1  diabetes  (%) 

100% 

100% 

— 

— 

Mean  age  at  T1D 
Diagnosis  (yr) 

12.7  ± 

7.9  12.2  ±  7.1 

cohort  (n  =  394)  was  recruited  independently  from  the  CHP  cohort 
by  collaborative  efforts  of  the  Juvenile  Diabetes  Research  Founda¬ 
tion,  National  Institutes  of  Health,  and  U.S.  Center  of  Disease  Con¬ 
trol.10  Material  from  the  GoKinD  cohort  was  provided  as  solutions 
of  DNA,  purified  from  lymphoblastoid  cell  lines  or  from  whole 
blood.  DNA  solutions  were  provided  as  50  pi  aliquots  containing 
-100  ng/pl  DNA  per  aliquot  dissolved  in  20  mM  NaCl  and  1  mM 
EDTA  (pH  7.5).  DNA  from  the  CHP  samples  were  obtained  from 
whole  blood  with  methods  described  in  Ringquist,14  and  genotyp¬ 
ing  was  performed  by  Affymetrix  Services  Laboratory  (Affymetrix) 
with  GeneChip  500K  arrays.  All  of  the  genotype  data  from  GoKinD 
samples  generated  by  this  project  will  be  submitted  to  an  accessible 
database,  such  as  dbGaP  or  TIDbase  (see  Web  Resources). 

All  T1D  samples  had  a  sufficient  completion  rate  (>95%)  for 
inclusion,  as  did  almost  all  KORA  and  PopGen  samples.  Initially, 
genotypes  for  all  three  samples  were  called  with  the  BRLMM  algo¬ 
rithm.15  By  using  three  criteria  for  genotype  QC  per  SNP — greater 
than  90%  genotype  calls,  test  statistic  for  Hardy- Weinberg  yields 
p  value  >  0.005,  and  minor  allele  frequency  >0.05 — we  removed 
-140,000  SNPs  and  retained  360,000  for  the  T1D  sample,  similar 
to  other  studies.  When  we  contrasted  the  T1D  samples  to  the  con¬ 
trol  samples,  we  noted  SNPs  with  very  different  allele  frequencies 
that  were  not  in  or  near  known  T1D  loci.  Inspection  of  the  allele 
frequencies  showed  that  the  control  allele  frequencies  were  re¬ 
markably  similar  to  HapMap  frequencies  (see  Web  Resources), 
but  the  corresponding  genotype  clusters  for  the  T1D  samples 
had  undesirable  features. 

We  tried  various  ways  to  improve  the  genotype  calls.  First,  we 
looked  for  substantial  differences  between  the  calls  by  using  the 
two  algorithms  employed  by  Affymetrix,  namely  DM  and  BRLMM. 
Although  some  discrepancies  were  noted,  we  did  not  see  a  material 
improvement  in  the  data  by  eliminating  this  small  set  of  loci.  Next, 
because  we  had  the  Affymetrix  "cel"  files  for  the  PopGen  control 
sample,  we  called  all  of  these  genotypes  for  PopGen  and  T1D 
together  by  using  both  the  DM  and  BRLMM  algorithms.  Again, 
this  process  eliminated  some  problematic  loci,  but  the  results 
were  not  compelling.  Finally,  we  tried  the  new  Bayesian  calling 
algorithm,  CHIAMO.16  This  algorithm  led  to  a  marked  improve¬ 
ment  for  the  genotype  calls,  as  determined  by  inspection  of  the  ge¬ 
notype  clusters.  For  our  data,  we  found  that  analyzing  the  PopGen 
and  T1D  data  together  (batch)  yielded  slightly  better  results  than 
analyzing  the  two  data  sets  as  complementary  strata,  so  these 
were  the  data  we  reported.  Because  we  had  greater  confidence  in 
the  BRLMM  calls  for  chromosome  X,  we  reported  those  calls  for 
X-Iinked  SNPs.  Because  the  KORA  sample  came  to  us  only  with 


The  American  Journal  of  Human  Genetics  82,  453-463,  February  2008  455 


genotypes  called  by  the  BRLMM  algorithm,  we  used  those  geno¬ 
types  for  that  data  set. 

Preliminary  quality  control  consisted  of  a  six-step  process  that 
reduced  the  number  of  cases  to  415,  controls  to  2112,  and  SNPs 
to  284,216.  Step  1:  Removed  a  case  who  was  a  clear  outlier.  Step  2: 
Removed  32  controls  who  had  greater  than  5%  missing  genotypes. 
Step  3:  Removed  90,732  SNPs  with  >5%  noncall  rate  in  at  least 
one  of  the  three  samples.  Step  4:  Removed  105,658  SNPs  with  mi¬ 
nor  allele  frequencies  less  than  0.05  in  either  control  sample.  Step 
5:  Removed  1972  SNPs  with  FSt  >  0.02  (estimated  for  the  two  Ger¬ 
man  control  samples).  Step  6:  Removed  18,427  SNPs  that  violated 
Hardy-Weinberg  equilibrium  (p  <  0.005)  in  either  of  the  control 
samples. 


Results 

Simulations 

We  compare  three  approaches  to  correct  for  the  effects  of 
structure:  Eigenstrat  and  GEM  with  fMatch  and  pMatch. 
Although  we  compare  their  size  (i.e.,  rate  of  false  positives) 
and  power,  these  approaches  are  not  direct  competitors. 
The  GEM  methods  are  designed  to  limit  analysis  to  strata 
that  are  chosen  a  priori,  whereas  Eigenstrat  aims  to  remove 
the  effects  of  structure  in  the  analysis  stage. 

Allele  frequencies  for  the  subpopulations  were  generated 
with  the  "Balding-Nichols"  model17  (see  Appendix),  with 
allele  frequencies  varying  uniformly  between  0.05  and 
0.5.  To  correct  for  structure,  L  reference  SNPs  were  gener¬ 
ated.  Of  these  SNPs,  99%  had  a  minor  amount  of  variability 
across  subpopulations  (Fst  =  0.01),  and  1%  had  substantial 
differentiation  ( Fst  =  0.1).  Null  or  causal  candidate  SNPs  of 
three  levels  of  Fst  were  generated:  Model  (!)  strongly 
differentiated  SNPs,  FSt  =  0.1;  Model  (2)  moderately  differ¬ 
entiated  SNPs  with  Fst  =  0.03;  and  Model  (3),  modestly  dif¬ 
ferentiated  SNPs  with  Fst  =  0.01. 

Ten  panels  of  independent  reference  SNPs,  with  L  rang¬ 
ing  from  96  to  100,000,  were  generated.  For  each  of  these 
panels,  we  simulated  1000  independent  causal  SNPs  and 
1000  independent  null  candidate  SNPs.  We  repeated  this 
analysis  for  models  (1),  (2)  and  (3)  and  for  six  choices  of 
L.  Causal  SNPs  with  relative  risk  R  =  2  were  generated 
with  the  approach  described  in  Price4  for  power  calcula¬ 
tions. 

Our  first  battery  of  simulations  is  based  on  SNPs  sampled 
from  two  subpopulations,  with  200  individuals  per  sub¬ 
population.  Case  status  was  assigned  to  80  and  20  of  the 
individuals  from  subpopulations  1  and  2,  respectively. 
The  remaining  individuals  were  assigned  control  status. 
For  the  matched-pairs  analysis,  we  paired  each  case  to 
the  closest  control  until  we  obtained  100  matched  pairs. 
For  the  other  two  methods  of  analyses,  we  analyzed  all 
400  individuals.  Each  method  readily  detects  population 
substructure  and  achieves  the  desired  type  I  error  rate  as 
L  increases  (Table  2).  pMatch  and  fMatch  successfully 
remove  the  effect  of  structure  with  a  smaller  panel  of  refer¬ 
ence  SNPs  than  Eigenstrat  does  (Figure  2A).  Indeed,  when 
a  large  panel  of  reference  SNPs  is  available,  the  GEM  proce¬ 


dures  are  overly  conservative;  consequently,  Eigenstrat  is 
slightly  more  powerful  than  both  matching  procedures 
(Table  2)  under  these  conditions.  For  SNPs  with  less  infor¬ 
mation  about  population  membership  than  present  in  our 
simulated  reference  panels,  greater  numbers  of  SNPs  would 
be  required  to  remove  the  effects  of  structure.4 

Our  second  battery  of  simulations  is  based  on  nine 
subpopulations  distributed  along  a  gradient,  designed  to 
simulate  a  dine  such  as  the  north  to  south  dine  observed 
in  western  Europe.  The  100  cases  are  distributed  with  2,  4, 
6,  7,  9,  12,  15,  20,  and  25  individuals  in  populations  1-9, 
respectively.  The  300  controls  are  distributed  randomly 
across  the  nine  subpopulations.  Results  from  this  simula¬ 
tion  are  qualitatively  similar  to  those  shown  in  Figure  2A 
(Table  2).  The  first  two  batteries  of  simulations  illustrate 
that  when  the  case  and  control  samples  are  drawn  from 
the  same  subpopulations,  but  with  different  frequencies, 
the  effects  of  substructure  can  be  removed  with  any 
of  the  three  methods  described.  Even  the  effects  of  highly 
differentiated  SNPs  can  be  removed  provided  the  reference 
panel  is  sufficiently  informative. 

Our  third  battery  of  simulations  is  also  based  on  a  nine 
population  gradient;  however,  the  cases  and  controls  are 
apportioned  in  a  manner  that  simulates  the  complexity 
of  human  populations  and  GWA  designs.  As  in  the  previ¬ 
ous  simulation,  we  simulate  nine  populations  and  draw 
300  controls  randomly.  In  contrast,  all  50  of  the  cases  are 
drawn  from  populations  6-9.  Because  of  the  nature  of 
this  third  battery,  namely  the  presence  of  unmatchable 
observations,  we  analyze  the  data  in  two  ways:  Unmatch¬ 
able  observations  are  removed  as  described  previously;  or 
unmatchable  observations  are  retained.  In  choosing  only 
a  single  control  for  each  case,  pMatch  includes  only  50 
of  the  controls  in  the  study  regardless  of  the  treatment  of 
outliers.  Provided  the  reference  panel  is  sufficiently  infor¬ 
mative,  many  of  these  controls  will  be  derived  from  popu¬ 
lations  6-9.  Eigenstrat,  on  the  other  hand,  uses  all  of 
the  data,  as  will  fMatch  when  unmatchable  observations 
are  retained.  For  fMatch,  this  means  that  cases  drawn 
from  population  6  will  tend  to  have  many  controls  in  their 
strata  sampled  from  populations  1-5.  The  remaining  cases 
will  tend  to  have  only  one  or  two  controls  in  their  strata. 
By  grouping  the  outlying  observations,  fMatch  attempts 
to  minimize  the  effect  of  unmatchable  observations.  Ei¬ 
genstrat  must  account  for  controls  sampled  from  pop¬ 
ulations  1-5  with  regression  techniques,  which  are  well 
known  to  suffer  adverse  consequences  when  they  are 
extrapolating  beyond  the  range  of  the  data. 

When  unmatchable  observations  are  retained,  pMatch 
corrects  for  the  effects  of  substructure  with  fewer  reference 
SNPs  than  the  other  two  methods  (Table  3  and  Figure  2B). 
Indeed,  Eigenstrat  fails  to  remove  the  effects  of  population 
substructure.  By  comparing  pMatch  and  fMatch,  we  see 
that  the  latter  has  greater  power.  This  makes  sense  because 
fMatch  is  using  more  of  the  data  (Table  3). 

On  the  basis  of  the  clustering  and  rescaling  process,  most 
of  the  controls  from  populations  1-5  are  unmatchable,  and 
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Table  2.  Size  and  Power  of  Tests  at  Level  0.05 


Eigenstrat  with  Fst  pMatch  with  Fst  fMatch  with  Fst 


Statistic  Design 

No.  of  Markers 

0.01 

0.03 

0.1 

0.01 

0.03 

0.1 

0.01 

0.03 

0.1 

Size 

Two  Populations 

96 

.069 

.106 

.211 

.062 

.100 

.202 

.065 

.101 

.206 

386 

.055 

.061 

.085 

.044 

.045 

.051 

.047 

.049 

.054 

1536 

.052 

.054 

.055 

.046 

.045 

.045 

.047 

.047 

.046 

6144 

.053 

.052 

.051 

.044 

.045 

.045 

.047 

.047 

.046 

12000 

.053 

.052 

.052 

.044 

.044 

.045 

.047 

.045 

.047 

24000 

.053 

.050 

.051 

.043 

.042 

.041 

.046 

.046 

.045 

Gradient 

96 

.069 

.109 

.221 

.049 

.067 

.109 

.061 

.097 

.201 

386 

.054 

.058 

.071 

.043 

.046 

.048 

.048 

.052 

.063 

1536 

.052 

.051 

.050 

.045 

.044 

.044 

.047 

.047 

.046 

6144 

.052 

.051 

.050 

.045 

.045 

.044 

.046 

.045 

.047 

12000 

.052 

.052 

.052 

.044 

.044 

.045 

.047 

.047 

.047 

24000 

.052 

.052 

.052 

.044 

.045 

.044 

.047 

.046 

.046 

Power 

Two  Populations 

96 

.783 

.710 

.683 

.693 

.635 

.620 

.754 

.685 

.659 

386 

.767 

.701 

.682 

.682 

.632 

.621 

.735 

.673 

.653 

1536 

.766 

.702 

.677 

.683 

.635 

.622 

.736 

.674 

.653 

6144 

.765 

.694 

.676 

.684 

.630 

.623 

.735 

.671 

.653 

12000 

.765 

.697 

.676 

.684 

.633 

.624 

.735 

.673 

.653 

24000 

.763 

.696 

.677 

.684 

.632 

.624 

.734 

.671 

.653 

Gradient 

96 

.939 

.917 

.833 

.886 

.872 

.804 

.922 

.900 

.814 

386 

.924 

.891 

.796 

.877 

.857 

.782 

.902 

.869 

.775 

1536 

.917 

.876 

.774 

.876 

.850 

.775 

.894 

.856 

.754 

6144 

.913 

.874 

.768 

.873 

.849 

.773 

.892 

.853 

.747 

12000 

.915 

.873 

.771 

.874 

.849 

.774 

.891 

.850 

.749 

24000 

.912 

.874 

.768 

.873 

.849 

.771 

.892 

.852 

.747 

Columns  depict  the  results  as  Fst  varies  (0.01,  0.03,  and  0.1)  in  the  candidate  markers.  Results  are  shown  for  two  scenarios: 

a  two-population 

mixture  and 

a  nine-population  gradient.  For  the  size,  the  expected  number  of  p  values  smaller  than  0.05  is  50. 

such  a  result  is  desirable  because  cases  were  only  drawn  from 
populations  6-9.  In  this  instance,  the  size  of  the  matched 
analyses  is  now  closer  to  the  nominal  level  even  when  L  is 
small,  as  expected.  Interestingly,  there  is  the  considerable 
enhancement  in  power  for  fMatch  and  pMatch  when 
unmatchable  individuals  are  removed,  as  recommended 
by  our  methods,  as  opposed  to  when  they  are  forced  to  be 
retained  (Table  3).  This  occurs  because  removal  of  the  out¬ 
liers  leads  to  improved  performance  of  the  EVD  and  hence 
superior  choices  of  matches  in  the  analysis.  In  addition, 
for  fMatch  the  removal  of  controls  from  populations  1-5 
leads  to  a  more  homogeneous  sample  that  tends  to  increase 
power. 

Eigenstrat  defines  outliers  without  specific  reference  to 
cases  and  controls;  thus,  none  of  the  observations  are 
unmatchable  observations.  Nevertheless,  if  the  regression 
approach  is  applied  after  removal  of  those  observations 
declared  unmatchable  by  the  fMatch  procedure,  the 
type  I  error  is  successfully  controlled,  and  the  power  is 
slightly  greater  than  it  is  for  fMatch  (Table  3).  This  hybrid 
approach  to  analysis  has  some  potential  for  further 
development. 


GWA  of  Type  1  Diabetes  Data  with  fMatch 

We  analyzed  416  cases  of  T1D,18  derived  from  the  Go- 
KinD10  cohort  (n  =  394)  and  T1D  patients  recruited  from 
the  Children's  Hospital  of  Pittsburgh  (n  =  28).  Samples 
were  genotyped  with  the  Affymetrix  500K  GeneChip.  All 
identified  their  ancestors  as  European.  The  mean  age  of  on¬ 
set  for  T1D  was  12.2  and  12.7  years  of  age  for  the  GoKinD 
and  Pittsburgh  cohorts,  respectively.  Controls  genotyped 
by  the  same  chip  were  obtained  from  the  PopGen  and 
KORA  repositories,  which  consist  of  500  individuals  from 
north  Germany  (PopGen)  and  1644  individuals  from 
southern  Germany  (KORA). 11-13,19  The  four  cohorts  were 
recruited  independently  of  one  another.  The  relevant  char¬ 
acteristics  of  these  cohorts  are  summarized  in  Table  1 . 

Stringent  quality  control  reduced  the  number  of  SNPs  to 
284,216  and  the  number  of  controls  to  2112  (samples  were 
removed  if  the  rate  of  missing  genotypes  exceeded  5%). 
To  reconstruct  ancestry,  we  chose  23,552  independent  or 
"tag”  SNPs  by  using  the  H-clust  algorithm20  with  an  r2  cut¬ 
off  value  of  0.04.  Both  case  and  control  individuals  exhibit 
complex  population  heterogeneity.  For  example,  individ¬ 
uals  were  included  in  the  PopGen  and  KORA  registry  on 
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Figure  2.  False-Positive  Rate  versus  Log  of  the  Number  of 
Markers  Available  for  Estimating  Structure 

Results  are  for  Eigenstrat  (black),  pMatch  (bLue),  and  fMatch  (red). 
The  desired  nominal  rate  of  0.05  is  plotted  as  a  yellow  line.  In  (A), 
a  sample  derived  from  two  simulated  populations  is  shown.  Results 
are  displayed  for  markers  with  two  levels  of  differentiation 
Fst  =  0.1(*)  and  0.03  (+).  The  former  exhibits  a  higher  rate  of 
errors  than  the  latter  for  small  numbers  of  markers.  In  (B),  a  sample 
derived  from  a  gradient  of  simulated  populations  is  shown.  Results 
are  displayed  for  the  full  sampLe  (plotting  character  "o")  and  with 
unmatchable  individuals  removed  (plotting  character  this 
applies  to  the  matching  methods  only). 

the  basis  of  residence  rather  than  known  German  ancestry. 
We  removed  one  case  individual  who  had  very  different 
ancestry  from  the  other  415.  For  415  cases  and  2112  con¬ 
trols,  D  =  22  dimensions  were  required  to  explain  the 
significant  axes  of  genetic  variation.  Many  of  these  axes 
exhibited  extreme  outliers  (Figure  1A).  After  removing  53 
controls,  only  three  important  axes  of  variation  remained. 
On  the  basis  of  the  first  two  eigenvectors,  a  duster  of  cases 
that  differs  in  ancestry  from  the  control  sample  was  clearly 
evident  (Figure  3A).  To  identify  unmatchable  individuals 
more  completely,  we  computed  the  distance  between 
each  case  and  the  nearest  control  and  vice  versa  on  the  ba¬ 
sis  of  three  axes  of  the  EVD  map.  The  resulting  distribution 
of  distances  indicated  that  21  cases  could  not  be  matched 
to  a  control  with  similar  ancestry  (Figure  4).  By  repeating 
this  process  of  finding  the  significant  eigenvalues  and  the 
corresponding  minimum  distances  between  cases  and  con¬ 
trols  in  the  corresponding  axes,  we  subsequently  removed 
an  additional  one  case  and  15  controls.  After  excluding 
these  outliers,  only  two  significant  eigenvalues  remain 
when  a  significance  level  of  0.01  was  used. 

Next,  with  cluster  analysis  to  identify  homogeneous 
strata,  2136  individuals  were  clustered  into  26  strata,  each 
with  20  or  more  elements  and  no  significant  structure 
within  cluster  (p  >  0.001).  The  remaining  301  individuals 
were  clustered  into  24  small  clusters.  On  the  basis  of  these 
strata,  the  data  were  rescaled  and  the  distance  between 
cases  and  matched  controls  was  determined.  Those  that 


Table  3.  Size  and  Power  of  the  Tests  before  and  after 
Removing  Outliers,  with  Eigenstrat,  pMatch,  and  fMatch 

Outliers  Present 

Eigenstrat  with  pMatch  fMatch 

Fst  with  Fst  with  Fst 

Statistic  No.  of  Markers  .01  .03  .1  .01  .03  .1  .01  .03  .1 

Size 


96  .064  .097  .206  .044  .056  .107  .056  .082  .176 

386  .057  .068  .121  .038  .040  .050  .045  .051  .070 

1536  .056  .062  .103  .037  .037  .037  .043  .042  .045 

6144  .056  .061  .085  .037  .037  .035  .041  .042  .041 

12000  .058  .058  .073  .037  .036  .036  .042  .040  .041 

24000  .057  .058  .067  .037  .037  .035  .042  .042  .040 

100000  .055  .057  .064  .037  .037  .034  .043  .043  .042 

Power 


96  .804  .753  .650  .590.579.511.770.726.623 

386  .784  .731  .630  .583  .566  .489  .721  .686  .583 

1536  .771  .716  .615  .581  .567  .482  .671  .642  .548 

6144  .762  .711  .604  .583.566.485.639.620.531 

12000  .751  .704  .595  .582.564.485.637.615.528 

24000  .746  .699  .593  .584  .565  .484  .637  .613  .529 

100000  .748  .694  .592  .588  .565  .484  .639  .612  .528 

Outliers  Removed 

Size 


96  .061  .090  .195  .037  .036  .035  .044  .043  .042 

386  .057  .060  .095  .036  .036  .035  .043  .041  .043 

1536  .054  .053  .057  .035  .038  .033  .040  .044  .041 

6144  .054  .053  .056  .040  .037  .037  .044  .044  .043 

12000  .052  .053  .054  .038  .035  .033  .041  .042  .041 

24000  .052  .052  .053  .036  .039  .034  .041  .045  .041 

100000  .052  .052  .053  .037  .035  .034  .042  .042  .042 


Power 


96  .906  .931  .927  .706  .713  .656  .772  .776  .719 

386  .873  .885  .870  .698  .713  .656  .771  .769  .726 

1536  .856  .857  .834  .700.716.660.771.774.727 

6144  .849  .850  .829  .703  .716  .666  .771  .774  .729 

12000  .843  .843  .818  .703  .713  .663  .769  .767  .726 

24000  .840  .840  .817  .701  .715  .667  .771  .775  .726 

100000  .835  .834  .813  .700  .719  .669  .772  .776  .728 

Columns  depict  the  results  as  Fst  varies  (.01,  .03,  and  .1)  in  the  candidate 
markers.  The  simulated  data  are  a  gradient  with  nine  subpopulations; 
controls  are  drawn  from  1-9  and  cases  are  only  from  6-9. 


were  considered  unmatchable  individuals  on  the  basis  of 
the  simulation  results  were  removed  (see  Appendix).  With 
this  process,  an  additional  20  cases  and  48  controls  are 
removed  from  the  dataset  for  fMatch.  The  resulting  dis¬ 
tance  between  the  remaining  cases  and  controls  in  fMatch 
is  consistent  with  expectations  for  cases  and  controls 
matched  within  homogeneous  strata  (data  not  shown).  In 
the  reduced  fMatch  sample,  two  principal  axes  separate 
the  German  control  samples  by  region  and  define  a  space, 
spanned  by  both  cases  and  controls,  that  facilitates  match¬ 
ing  (Figure  3B).  These  dimensions  presumably  map  onto 
genetic  gradients  on  the  European  continent;  e.g.,  the  hor¬ 
izontal  axis  is  likely  to  be  related  to  a  north-south  gradient 
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Figure  3.  Plots  of  the  First  Two  Eigen¬ 
vector  Axes  forTID  Data  before  and  after 
Removing  the  Unmatchable  Individuals 
after  Clustering  and  Rescaling  of  the 
Data 

Each  case  (light  bLue)  has  a  matched  con¬ 
trol  (dark  blue  =  South  Germany,  red  = 
North  Germany)  in  a  close  neighborhood 
after  removal  of  unmatchabLe  individuals; 
compare  before  (A)  with  after  (B). 
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Eigenvector  1 


because  it  tends  to  separate  the  German  samples  by  north 
(PopGen)12  and  south  (KORA)11  origin.21,22  In  the  pMatch 
sample,  one  additional  axis  is  needed  to  explain  important 
variation  (data  not  shown). 

After  final  removal  of  outliers  and  unmatchable  individ¬ 
uals  for  fMatch,  cases  and  controls  were  stratified  on  the 
basis  of  their  genetic  ancestry  into  298  strata.  Most  of  the 
strata  (159)  contain  a  single  case  matched  to  several  con¬ 
trols.  A  single  case  matched  to  a  single  control  occurred 
in  1 1 1  strata.  A  minority  of  strata  (28)  contain  a  single  con¬ 
trol  matched  to  multiple  cases.  For  example,  in  the  most 
extreme  strata,  a  single  case  was  matched  to  71  controls 
and  a  single  control  was  matched  to  13  cases.  When  a  sin¬ 
gle  case  is  matched  to  a  large  number  of  controls  (or  vice 
versa),  the  information  gain  from  the  strata  is  essentially 
equivalent  to  that  obtained  from  a  single  case  matched 
to  a  moderate  number  of  individuals.  Nevertheless,  condi¬ 
tional  logistic  regression  is  valid  regardless  of  the  lack  of 


(a)  Distance  from  Cases  to  Controls 
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(b)  Distance  from  Controls  to  Cases 
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Figure  4.  The  Distance  between  Each  Case  and  the  Nearest 
Control  and  Vice  Versa  Based  on  Three  Principal  Components 
Are  Computed 

The  distributions  differ,  and  we  eliminate  34  cases  with  distances 
to  the  nearest  control  greater  than  0.075.  (A)  shows  the  histogram 
of  distances  between  each  case  and  the  nearest  control.  (B)  shows 
the  histogram  of  distances  between  each  control  and  the  nearest 
case. 


balance  in  the  strata.  In  all,  373  cases  were  contrasted 
with  1996  controls  by  conditional  logistic  regression  (Fig¬ 
ure  5,  top  panel).  The  results  highlight  the  HLA  region, 
which  contains  numerous  SNPs  achieving  GWA  signifi¬ 
cance.  Variation  in  the  HLA  region  is  well  known  to 
account  for  a  large  fraction  of  the  risk  for  T1 D.2  1-26  No 
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Figure  5.  Transformed  p  Values  after  Conditional  Logit 
Regression  Was  Performed  on  the  Data  Stratified  with  fMatch 

Transformed  p  values  (negative  of  the  log,  base  10).  ResuLts  from 
conditional  logistic  regression  on  the  data  stratified  with  fMatch 
(top  panel)  and  pmatch  (second  panel)  are  shown.  Results  ob¬ 
tained  with  Eigenstrat  are  shown  in  the  third  panel.  Results  ob¬ 
tained  when  removing  observations  with  very  divergent  ancestries 
(inferred  with  the  Eigenstrat  rule  for  outliers)  from  the  bulk  of  the 
sample,  which  was  European,  are  shown  in  the  bottom  panel. 
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other  location  in  the  genome  contains  SNPs  with  test 
statistics  meeting  reasonable  criteria  for  GWA  significance 
(<1CT7)  after  ensuring  quality  genotype  calls  by  visual 
inspection  of  the  genotype  clusters  (see  Figures  SI  and  S2 
available  online  for  examples).  It  should  be  noted,  how¬ 
ever,  that  visual  inspection  of  genotype  clusters  is  essential 
to  interpret  this  Affymetrix  "first-generation"  genotype 
data,  a  feature  other  GWA  studies  with  this  genotyping 
platform  also  report.27 

Results  from  fMatch  agree  with  our  expectations.  Other 
GWA  studies  have  established  that  all  genetic  variation 
thus  far  uncovered,  aside  from  variation  in  the  HLA  region, 
account  for  a  modest  portion  of  the  risk  for  T1D. 23,29 
For  detecting  loci  of  modest  effect  with  good  power,  either 
sample  sizes  must  be  substantial  (i.e.,  thousands  of  cases 
and  controls  genotyped)  or  a  staged  study  design  must  be 
employed.  The  staged  design  typically  sets  a  significance 
level  between  0.01  and  0.001  in  stage  1,  then  genotypes 
all  loci  meeting  this  significance  level  (and  quality-control 
criteria)  in  a  second,  larger  sample.30,31  Treating  our  study 
as  stage  1  with  a  significance  level  of  0.007, 30  results  from 
fMatch  would  include  SNPs  for  genotyping  in  stage  2  from 
six  out  of  ten  loci  now  believed  to  confer  risk  to  T1D.29  Of 
the  remaining  four  loci,  only  one  had  more  than  a  few 
SNPs  in  the  region. 

Aside  from  the  HLA  region,  SNPs  in  or  near  PTPN22  (MIM 
600716),  IL2RA  (CD25  [MIM  147730]),  and  CTLA4  ([MIM 
123890];  window  =  gene  location  ±  40  Kb)  showed 
enough  signal  to  be  passed  to  stage  2.  The  smallest  p  value 
for  each  gene  was  0.000706  (rs2488457),  0.000995 
(rsl0905669),  and  0.000336  (rs231726).  The  smallest 
p  values  for  SNPs  close  to  "risk  SNPs"  rs2292239  and 
rsl2708716  were  0.00667  (rs2292239)  and  0.00539 
(rsll647011)  for  window  =  SNP  location  ±  50  Kb.  Could 
it  be  that  the  signals  in  these  regions  occurred  by  chance? 
To  answer  this  question,  we  performed  a  simulation  exper¬ 
iment.  We  randomly  select  from  the  genome  ten  inter¬ 
vals  that  correspond  to  the  same  size  as  the  original  ten 
windows  (for  the  HLA  region,  we  assumed  a  window  of 
3  Mb).  Then,  we  count  the  number  of  intervals  in  which 
one  or  more  SNPs  have  p  <  0.007  and  would  thus  be  geno¬ 
typed  in  stage  2.  We  perform  this  random  selection 
106  times,  counting  how  many  times  six  or  more  intervals 
would  have  SNPs  genotyped  in  Stage  2.  By  this  experiment, 
we  determined  that  our  results  would  rarely  occur  by 
chance,  roughly  one  in  ten  thousand  times. 

A  few  other  observations  from  these  analyses  are  worth 
noting.  Within  the  HLA  region,  Todd29  cites  rs3129934  as 
the  replicated  SNP;  our  independent  data  and  analyses  yield 
a  p  value  for  association  of  7.2  x  lCT10with  this  SNP;  forthe 
replicated  SNP  identified  in  CTLA4,  rs3087243,  our  data 
and  analyses  yield  a  p  value  for  association  of  0.013,  and, 
as  noted  above,  the  replicated  SNP  rs2292239  produced 
a  p  value  of  0.00667  from  our  data.  Although  the  HLA 
region  needs  no  more  support,  our  results  provide  further 
evidence  for  replication  in  CTLA4  and  at  rs2292239.  For 
genotype  cluster  plots  for  the  cited  SNPs,  see  Figure  SI.  In 


addition,  for  all  of  the  loci  cited  above,  we  have  compared 
our  data  to  that  reported  by  the  Wellcome  Trust  Case 
Control  Consortium.27  For  these  loci,  the  allele  in  excess 
in  cases  is  the  same  for  both  data  sets  (data  not  shown). 

Four  loci  did  not  pass  stage  1  criteria.  None  of  these  SNPs 
reported  by  Todd  et  al.29  as  risk  loci  were  on  our  Affymetrix 
genotyping  array.  Of  these  four  risk  SNPs,  only  rsl893217 
in  18pll  was  covered  well  in  terms  of  genotyped  SNPs  in 
substantial  linkage  disequilibrium  (LD)  with  it.  This  SNP 
is  in  almost  complete  LD  with  rs2542151  according  to 
HapMap;  it  passed  our  QC,  but  it  shows  no  evidence  for 
association  in  our  data  (p  =  0.51).  For  the  proinsulin 
precursor  gene,  INS  (MIM  176730),  only  two  SNPs  on  the 
array  pass  QC  and  fall  in  the  region,  but  HapMap  contains 
no  information  about  their  LD  with  the  reported  risk  SNP, 
rs689,  and  they  show  no  association  (p  >  0.35).  For  the 
gene  encoding  interferon-induced  helicase  C  domain-con¬ 
taining  protein  1,  IFIH1  (MIM  606951),  the  reported  risk 
allele  shows  modest  LD  with  a  SNP  we  genotyped,  namely 
rs7608315,  which  shows  no  association  (p  =  0.38).  Finally, 
for  the  12q24  region,  rs3184504  is  identified  as  the  risk 
SNP.  One  SNP  in  this  region  passed  QC  for  our  data,  and 
it  is  modestly  associated  with  risk  for  T1D  (p  =  0.046). 

The  vast  majority  of  the  SNPs  from  this  or  any  relevant 
GWA  are  independent  of  risk  for  T1D.  Many  SNPs  from 
the  HLA  regions  of  chromosome  6  are  associated,  however. 
After  eliminating  HLA  SNPs,  -5%  of  the  association  tests  are 
expected  to  have  p  values  <0.05.  Of  the  284,216  tests,  7.0% 
were  significant  at  a  =  0.05  for  fMatch.  A  moderate  excess  of 
false  positives  occurs  for  any  reasonable  choice  of  a.  Given 
the  success  of  the  GEM  method  in  the  simulations,  in  terms 
of  controlling  the  false-positive  rate,  we  wondered  whether 
the  source  of  additional  false  positives  could  be  poor-qual¬ 
ity  genotype  calls.  Indeed,  by  assessing  genotype  clusters 
for  all  SNPs  producing  p  values  <  10  4,  we  find  a  rate  of 
poor  calls  of  60%-67%  (Figure  S2).  The  rate  of  poor-quality 
genotype  calls  increases  as  the  p  value  decreases.  Predomi¬ 
nantly,  the  problematic  calls  occur  for  the  T1D  sample. 
On  the  basis  of  our  estimated  rate  of  poor-quality  genotype 
calls,  we  believe  the  excess  false-positive  rate  is  attributable 
to  data-quality  issues,  not  the  method. 

We  also  analyzed  GWA  data  by  using  pMatch  and  Eigen- 
strat  and  by  ignoring  population  substructure  after  discard¬ 
ing  outliers  with  the  Eigenstrat  rule  (see  Figure  5).  As  ex¬ 
pected,  pMatch  shows  the  lowest  rates  of  positive  findings, 
whereas  ignoring  structure  yields  the  most.  Like  fMatch, 
it  appears  the  excess  of  false  positives  for  pMatch  is  due  to 
poor-quality  genotype  calls.  The  same  is  predominantly 
true  for  Eigenstrat,  but  we  note  that  an  ample  number  of 
SNPs  producing  small  p  values  are  not  attributable  to  poor 
quality,  and  this  problem  is  amplified  by  ignoring  structure. 
At  significance  level  0.0001,  after  visual  inspection  of  geno¬ 
types  fMatch  has  half  the  false-positive  rate  of  Eigenstrat. 

To  further  validate  GEM,  we  tried  a  null  experiment.  We 
randomly  labeled  half  of  the  KORA  data  as  cases  and  half  as 
controls  and  repeated  the  matching  analysis.  Removal  of 
72  outliers  reduced  the  number  of  significant  eigenvalues 
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required  to  explain  the  variation  from  24  to  2.  After  this 
simplification,  only  12  unmatchable  individuals  remained. 
All  three  methods  of  analysis  (Eigenstrat,  Pmatch,  and 
fMatch)  produced  type  I  error  rates  that  were  on  target. 

Discussion 

Our  GWA  analyses  of  T1D  are  meant  to  accomplish  two 
goals.  First,  they  illustrate  the  utility  of  ancestry  matching 
in  the  face  of  a  very  difficult  problem,  that  being  when  cases 
are  sampled  in  a  region  quite  different  from  the  region  of 
the  controls.  In  our  case,  the  T1D  sample  comes  from  any 
American  of  nominal  European  ancestry,  whereas  the  con¬ 
trols  were  recruited  among  residents  of  Germany.  Such  con¬ 
stellations  can  also  arise  even  if  cases  and  controls  are  sam¬ 
pled  from  the  same  geographical  region.  We  would  expect 
the  example  to  be  especially  salient  for  American  samples. 
Second,  we  wished  to  use  the  results  to  evaluate  reported 
T1D  risk  loci  and,  in  later  analyses,  discover  new  loci.  The 
results  show  that  genetic  or  ancestry  matching  can  be  an 
important  ingredient  in  the  toolbox  of  researchers  who 
are  performing  GWA  analyses.  Moreover,  our  results  do 
lend  support  for  previous  GWA  findings  for  T1D.28,29 

We  do  not  yet  know  whether  our  analyses  have  identified 
any  new  risk  loci  for  T1D.  Although  it  seems  unlikely  given 
the  modest  sample  of  cases,  a  substantial  number  of  con¬ 
trols  have  been  analyzed.  Moreover,  for  a  rare  disease  like 
T1D,  using  unscreened  instead  of  screened  controls  has  al¬ 
most  no  impact  on  power.82  We  plan  various  kinds  of  stage 
2  analyses  to  assess  the  association  signals  from  our  GWA 
results.  In  addition,  by  agreement  the  data  generated  by 
our  project  will  be  reported  back  to  the  GoKinD  database, 
and  GoKinD  will  make  the  data  available  to  qualified  in¬ 
vestigators.  Thus,  these  data  will  shortly  be  available  to 
the  research  community,  and  we  will  be  pleased  to  share 
detailed  results  upon  request. 

We  have  described  how  to  use  genetic  matching  to  en¬ 
hance  a  case-control  study.  We  note,  however,  that  these 
methods  can  also  be  used  for  the  analysis  of  quantitative 
traits.  Once  homogeneous  clusters  are  identified,  they 
can  be  entered  into  a  model  as  block  effects,  and  the  quan¬ 
titative  trait  can  be  analyzed  with  standard  statistical  tools, 
such  as  analysis  of  variance. 

Theory,  simulations,  and  real-data  analyses  suggest  that 
genetic  matching  is  useful  and  powerful  for  GWA,  espe¬ 
cially  when  the  samples  of  cases  and  controls  cannot 
be  guaranteed  to  be  drawn  from  the  same  population.  It 
can  diminish  the  false-positive  rate,  sometimes  substan¬ 
tially,  and  have  only  modest  impact  on  power.  Among 
others,33-36  methods  similar  to  Eigenstrat4  also  limit  the 
impact  of  population  structure,  but  for  challenging  de¬ 
signs,  they  cannot  be  expected  to  completely  control  the 
false-positive  rate.  Perhaps  the  gold  standard  for  GWA 
studies  should  be  to  evaluate  the  data  with  both  regression 
methods  such  as  Eigenstrat  and  epidemiological  methods 
such  as  fMatch.  When  the  results  of  these  methods  agree, 


researchers  have  greater  assurance  of  validity;  it  is  when 
the  results  diverge  that  we  should  be  wary. 

Appendix 

EVD  of  Allele  Counts 

Using  allele  counts  for  SNPs  l  =  1,  ...,  L,  and  individuals 
i  =  1,  N,  create  an  N  x  L  matrix  X.  For  pi,  the  Zth  allele 
frequency,  center  allele  counts  in  column  /  by  subtracting 
2 pi  and  scale  by  dividing  by  (2p/(l  —  pi))1/2.  Find  the  EVD 
of  XX1  =  U XU1.  In  the  D  dimensional  space  defined  by 
the  top  D  eigenvectors,  the  "ancestry"  value  for  the  zth  sub¬ 
ject  is  determined  by  the  zth  row  of  the  eigenvectors  zz,j, . . ., 
uiD.  The  c/th  eigenvalue,  Xrf,  determines  the  scaling  of  dis¬ 
tances  in  the  dth  dimension.  These  coordinates  are  used 
for  matching. 

Model  for  Population  Stratification 

The  mean  of  allele  frequencies  from  a  set  of  populations  is 
assumed  to  be  the  allele  frequency  of  an  ancestral  popula¬ 
tion.  Individual  populations  have  each  diverged  from  the 
ancestral  population  over  time,  with  fixation  index  Fst, 
a  measure  of  population  differentiation.  Within  a  subpopu¬ 
lation  /,  suppose  that  allele  counts  are  independent  and 
identically  distributed  and  that  allele  a  is  drawn  with  prob¬ 
ability  pj.  If  X  is  counting  allele  a,  then  X  ~  Binomial(2,py). 
Let  P  be  the  random  variable  that  varies  across  subpopu¬ 
lations,  with  pj  as  the  realized  value  in  subpopulation 
j:  P  ~  Beta(ai,a2),  a\  +  «2  =  1  /Fst  —  1.  Assume  that  we 
have  the  minor  allele  frequencies  of  an  ancestral  popula¬ 
tion  p.loci  (in  our  simulations  p.loci  is  uniform  between 
.05  and  .5)  at  L  loci.  From  the  ancestral  population  /,  sub¬ 
populations  have  been  formed.  By  knowing  Fst,  for  each 
marker  Z  we  can  define  aij  =  p.locii  x  (1/Fst  —  1)  and 
«2 ,i  =  (1  —  p.locii)* (1  /Fst  —  1)  and  generate  the  alleles  as 
described  above.  When  used  in  simulation  studies,  this  is 
often  called  the  Balding-Nichols  model.17  For  simulation 
of  a  dine  (or  a  gradient),  it  is  enough  to  order  pji  so  that 
pi  i  < .  •  •  <pji  for  each  Z. 

Hypothesis  Test  for  Population  Structure 

A  formal  significance  test  for  population  structure  is  based 
on  a  theoretical  result  for  the  eigenvalue  distribution  of 
a  null  sample  covariance  matrix.6,37  For  a  homogeneous 
population,  the  largest  eigenvalue,  properly  normed,  ap¬ 
proximately  follows  the  Tracy- Widom  distribution37 
Wd  =  {^d  ~  Pni.) /aNi.  with  centering  and  scaling  parameters 
that  depend  on  both  N  and  L,  =  ((L  -  1)1/2  +  N1/2)2 
and  aNL  =  ((L  -  1)1/2  +  lW2)(l/((L  -  1)1/2)  +  1/NV2)1/3. 
We  can  test  the  null  hypothesis  of  population  homoge¬ 
neity  against  an  alternative  hypothesis  of  population  het¬ 
erogeneity.  The  sample  covariance  matrix  S  follows  a 
(ZV  —  l)x(N  —  1)  Wishart  distribution.  The  test  for  popula¬ 
tion  structure  will  be  applied  iteratively  (i.e.,  the  leading 
eigenvalue,  then  the  second  and  so  on).  If  we  find  the  first 
d  eigenvalues  X\,...,Xd  to  be  significant,  we  test  Xd+\  as 
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though  S  were  an  (N  —  d  —  1  )x(N  —  d  —  1)  Wishart  matrix. 
If  an  eigenvalue  is  not  significant,  the  smaller  eigenvalues 
will  not  be  significant  either. 

Removing  Unmatchable  Individuals 

EVD  determines  the  distance  between  individuals  on  the 
basis  of  the  top  D  eigenvectors  and  eigenvalues.  To  stabilize 
the  distance  metric,  we  use  the  normed  eigenvalues,  Wd, 
plus  a  constant  a,  chosen  to  ensure  the  weights  are  positive. 
The  distance  between  individuals  i  and  i'  is  calculated  as 
g(i,n  =  {Eli  (Wd  +  a)(uid  -  utdf}y\ 

To  rescale  the  distances,  let  Sk  c  { 1 ,2,...,N)  be  the  indices 
of  individuals  in  the  k'th  cluster.  Let  rk  be  the  number  of  in¬ 
dividuals  in  the  k'th  cluster.  For  scaling  subject  i  e  Sk  ,  we 
use  the  eigenvector  values  ( m,i,...,U;d )  but  not  the  eigen¬ 
values.  Assume  that  the  eigenvector  representation  of 
each  individual  consists  of  an  ancestry  signal  plus  random 
noise.  U(d  =  ,u T  £{d. 

For  homogeneous  data,  because  all  individuals  came 
from  a  common  source,  the  ancestry  signal  is  0  and  the 
representation  consists  simply  of  random  noise  uid  =  eid. 
Our  target  is  to  identify  approximately  homogeneous 
subpopulations  that  have  little  or  no  diversity  for  ancestry. 
If  the  clustering  is  successful,  the  signal  of  each  individual 
in  subset  Sk  can  be  approximated  by  udk  =  J2iczsk  uidirk,  and 
the  noise  can  be  approximated  by  uid  —  udk.  But  notice 
that  EVD  automatically  scales  the  eigenvectors  so  that 
uli  =  1  and  ud  =  0.  A  traditional  sum  of  squares  decom¬ 
position  leads  to 

I=Eum  =  EE  "  «*)2  +  J2  rkTldk’ 

i  k  i  eS*  k 

i.e.,  the  total  sum  of  squares  (SSTotal)  equals  the  sum  of 
squares  attributable  to  random  variation  or  error  (SSError) 
plus  the  sum  of  squares  attributable  to  ancestry  differences 
(SSModel).  Unit  scaling  of  SSTotal  causes  the  distances  be¬ 
tween  individuals  from  heterogeneous  populations  to  be 
uncomparable  to  distances  in  homogeneous  populations. 
For  example,  if  the  sample  derives  from  two  highly  differen¬ 
tiated  populations  so  that  SSError  =  0.01  and  SSModel  = 
0.99,  then  the  expected  distance  between  two  individuals 
with  common  ancestry  is  ~0.01/n.  Alternatively,  if  the 
populations  have  identical  ancestry,  then  the  expected 
distance  between  two  individuals  is  ~l/n.  For  comparing 
to  a  homogeneous  scaling,  we  wish  to  rescale  the  random 
noise  so  that  SSError  is  1.  It  follows  that  the  data  will 
be  rescaled  equivalently  to  homogeneous  data  if  we  set 
cd  =  J2kJ2.  s  (uid~Udk)2  and  rescale  the  data  such  that 
W/rf  =  uid/cd. 

In  practice,  udk  provides  a  good  estimate  of  the  signal 
only  when  the  cluster  size  is  sufficiently  large,  say  greater 
than  10.  Hence,  to  compute  c|,  include  only  those  clusters 
Sk  including  10  or  more  elements  in  the  sum  and  then 
multiply  by  n/  Y2k  irk  -  1)  to  account  for  the  missing  clus¬ 
ters.  Notice  that  we  scale  differently  for  each  of  the  d  di¬ 
mensions  to  stretch  and  shrink  accordingly  to  get  the 
proper  scaling  of  the  data. 


In  the  final  step,  find  the  distances  between  individuals 
with  the  u*id  instead  of  iiid  and  use  the  expected  value  of 
normed  eigenvalues  Wk,...,WD  obtained  from  the  simula¬ 
tion,  instead  of  the  actual  eigenvalues.  Match  rescaled  data 
with  fMatch  or  pMatch  and  measure  the  distances  between 
cases  and  controls.  Any  individuals  with  distances  in  this 
metric  exceeding  the  99.9th  quartile  of  the  null  distribu¬ 
tion  of  distances  are  declared  unmatchable. 

Supplemental  Data 

Two  figures  are  available  at  http://www.ajhg.org/. 
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